FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CAT AC AT T AC T GCAG T AACAC T CCAC CAT ATAGACCC GGC T T TAC CT TATAT CAG T GACAC T GG 
T ACAGTAGCTCCAGAAAAATGCT TAT T TGGGGCAATGCTAAATAT TGCGGCAGT TT TATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
H : TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
Jf CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
2 TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
|f| ATCAGT T TT GCACAGT GGCAAT T TT GGGACT GAT T T AGAACAGAAACT CCAT T GGAACCCCGAGG 

W ACAAAGGT TATGTGCT T CACATGAT CAC TAC T GCAGCAGAATGGT CT ATG TCATTT TCCT TC T TT 

■*« GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
JU TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
m GAGATATT TGA TGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 
□ TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAAT GC T GAT AAT C AGG AAAC AT G AAAGAAGC CAT T T GAT AG AT TAT T C T AAAG GAT AT CAT 
CAAGAAGAC TAT TAAAAACACCTAT GC C TAT ACT T T TT TATC TCAGAAAATAA&GT CAAAAGACT 
ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 

LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAAT GT GACTGACACGGATATT CTAGCCCT GGAACGGCGACT GCT GCAAACCATGGATATGAT 
CAT AAGCAAAAAGAAAAGGAT GGCAAT GGCACGGAGAACAATGT T CCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCTAC CAAGGAGAGAATAGAATACTCCAAAACCTT CAAGGGGAAATAT TTTAATT T TCT TGGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGATCCT GTCACAAGAGGCAT TGAGAT CACT GT GAATTAT CTGGGAAT CCAATTT GATGT 
GAAGT TT TGGTCC CAACACATTTCCTT CATT C T TGTT GGAATAAT CAT CGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGT CCTT GGAGAACT GCAGT T CAACTTCTAT CACCGT TGGTTT G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCT TGAACT TAAGC CTACTACAGACT GTTAGAGGCCAGTG GTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACAT GAGAGGGAGAACTAAC TCAAGACAAT ACTCAGCAGAGAGCATCCCGT GTGGATATGAGGCTGG 
TGT AGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAAT ACACTGGAAC T CTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAA?W\AAAAAAAAAAAGGGCGGCCGCG 
ACT CTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACT TGTT TAT TGCAGCTTATAATG 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNS S SRY FHWKMNLCVI LL I LVFMVP FY I GYFI VSNI RLLHKQRLLFSCLLWLT FMY FFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
L FLE TADL YATKER I E YSKT FKGKY FNFLGYFFS I YCVWKI FMAT IN I VFDRVGKT DPVT RG IE I 
TVNYLGIQFDVKFWSQH I S FI LVG 1 1 1 VTS I RGLL I TL TKFFYAI S S SKS SNVI VLLLAQ IMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQ FNF YHRW FDVI FLVSAL S S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTGCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCGATCCAGGT 
GT CAT GCAGAAT TAT GGGGATCACCCT T GTGAGCAAAAAGGCGAACCAGCAGC T GAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
H ATTATCACCACCAAAGATCCCATAT TCAACACTCAAACTGCAACACAAACAACAGAATT TATTGT 

p CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
fl CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
Jj? GAAACTAGCAC CAT GTC TACAGAAACTGAACCATT TGT T GAAAAT AAAGCAGCAT T CAAGAAT GA 

AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
III CAGC T GGT CT T GGAT T T TGC TATGT CAAAAGGT AT G TGAAGGCCT T CCC T TT TACAAACAAGAAT 

ill CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
f s | T GAGGAATCAAAGAAAAC TGATAAAAACC CAGAAGAGTCCAAGAGT CCAAGCAAAAC TACCGT GC 

'% GATGCC T GGAAGCT GAAG T T TAGAT GAGACAGAAAT GAGGAGACACACC T GAGGCTGGT TTCT TT 

~ : P CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
5 CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
r| GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
|r| TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
^ CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
y AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
ill GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
|!| GGCAGAAAAGT T TAGCCC CT GAAAGCCAT GGAGAT T CTCATAACT T GAGACC TAAT CTC TGTAAA 

ff j GCTAAAATAAAGAAATAGAACAAGGCTGAGGAT ACGACAGTACACT GT CAGCAGGGACT G T AAAC 

" ACAGACAGGGT CAAAGTGTT TTCT CTGAACACATTGAGTTGGAATCACTGTT TAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAACTCTTATAAATTTCTATTTTTATCTGAGTTACAGAAATGATTACTAAGGAAGATT 
ACTCAGTAATTTGT TTAAAAAGTAATAAAATTCAACAAACAT TTGCTGAATAGCTACTATATGTC 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TT T TAT T T T T GC TGAGAC TAAT CT TAT TCAT T T TC T CTAATAT GGCAACC AT TATAACC T TAAT T 
TAT TATTAACATAC CTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATT TTAAAAGTGCC 
ATTAACAAATGTAT CACTAGCCCT CCTT TTTCCAACAAGAAGGGACTGAGAGATGCAGAAATAT T 
TGTGACAAAAAATTAAAGCATTTAGAAAACTT 
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FIGURE 6 



MARCFS LVLLLTS I WT TRLLVQGS LRAEELS I QVSCR IMG I TLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQT'TEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVIALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GC T CT CAGC TACCCGCAGGAGGAGGCCACCC TCAATGAGAT G T TCCGCGAGGT TGAGGAAC T GAT 
GGAGGACACGCAGCACAAAT T GC GCAGCGCGGT GGAAGAGAT GGAGGCAGAAGAAGCTGCT GC T A 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
AC GAAGGT TGGAAAT AATACCAT C CAT GTGCACCGAGAAATT CACAAGAT AACCAACAACCAGAC 
TGGACAAATGGT C T T T T CAGAGACAGT TATCACAT C TGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAG T GCATCAT CGACGAGGAC T GTGGGCCCAGCATGTACT GCCAGT T TGCCAGC T TC CAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGC T AGT GAAGACCAGAGCAGT T TCATC T GGT T GTGAC TC TAAGCT CAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTC7\AACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
T ACAGGT TAACCT GCAGAAACAGT AC T TAGGTAAT T GTAGGGCGAGGAT TATAAAT GAAAT TTGC 
AAAATCACT TAGCAGCAACTGAAGACAAT TAT CAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGT GTGAAACATGGT TG TAATATGCGAC T GCGAACAC T GAACT CT AC GC CACT C CACAAAT GAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCATTTAGAA 
AT CAAG C AT AAAT C AC T T C AAC T G C AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SEVNLANLPPS YHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVI T SVGDEEGRRSHEC 1 1 DEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDS ECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCAS GLLCQPHSHS LVYVCKPT FVGSRDQDGE I LLPREVPDE YEVGS FMEEVRQELEDLE 
. . RS LTEEMALGEPAAAAAALLGGEE I 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 



Signal sequence : 




N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 



amino acids 1-19 



327-331 



fi 



N-myristoylation site. 

amino acids 202-208, 217-223 



Amidation site. 



amino acids 140-144 
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FIGURE Q 

CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAA.TCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT T GACCACAAGC T CAGAGGAGTGCAGAACCC TGTAGCCCGC T GCATCATGTGC TGT T T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACGTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT T TAT TAAAGATAT T T TG T TAAC TC 
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FIGURE 10 



RTRGRTRGGCEKVP INT SCNPTAHLWS SCPGLMCVFQGYS SKGL I QRSVFNLQI YGVLGLFWTL 
NWLALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EY I DHKLRGVQNPVARC IMCCFKCCLWCLEKFI KFLNRNAYIMIAI YGKNFCVSAKNAFMLLMRN 
I VRVWLDKVT DLLLFFGKLLWGGVGVLS FFFFS GR I PGLGKDFKS PHLNY YWL P IMT S I LGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites . 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Ami da ti on site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCG6CGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
l . GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
Q ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
Q CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
01 GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

m TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

III 

r: CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

%« CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

5 TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

Q GGGCAACGAGACAGTT GTGGCAGGCCCC GAGGGCTAT GAGACCCAGT GGTGGGAT GC CCCGAGCATTG 

111 TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

*** AACAGCCTGAT GCAGACCGAGGAGT GCCCACCTAT GC TAGACGCCACACAGCAGCAGCAGCAGCAGGT 

?= 5 

jj;J GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

»t ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

III' 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPR?VAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 

ECDSRAWYAGLFFFTLLFYLLSIAAVAmFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 

QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 

VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 

SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWKICASWAGLLLYLWTLVA 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCAT TGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTAT TCTTCATAT 
t . TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
-3 ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
Q TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
JJJ CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 

I I GT CAGT T T TAT T CC CC TCCTGAATCCGAAGCAGGAT CTGAAGAAGGTGAAGAAAAACAGGACAGT 
jZ GAGAAACCAC TTTTAGAACTATGAGTAC TACT TT TGTTAAATGTGAAAAACCCT CACAGAAAGTC 
3- ATCGAGGCAAAAAGAGGCAGGCAGT GGAGTC T CCC T GT CGACAGTAAAGT TGAAAT GGT GACGT C 

CAC T GC TGGC T T TATT GAACAGCTAATAAAGATT TAT T TAT TGTAATACCTCACAAACGT TGTAC 

If! 

catatccatgcacatttagttgcctgcctgtggctggtaaggtaatgtcatgattcatcctctct 
in. tcagtgagactgagcctgatgtgttaacaaataggtgaagaaagtcttgtgctgtattcctaatc 

HI 

p aaaagacttaatatattgaagtaacacttttttagtaagcaagatacctttttatttcaattcac 

III agaatggaatttttttgtttcatgtctcagatttattttgtatttcttttttaacactctacatt 
tcccttgttttttaactcatgcacatgtgctctttgtacagttttaaaaagtgtaataaaatctg 
acatgtcaatgtggctagttttatttttcttgttttgcattatgtgtatggcctgaagtgttgga 
cttgcaaaaggggaagaaaggaattgcgaatacatgtaaaatgtcaccagacatttgtattattt 
ttatcatgaaatcatgtttttctctgattgttctgaaatgttctaaatactcttattttgaatgc 
acaaaatgacttaaaccattcatatcatgtttcctttgcgttcagccaatttcaattaaaatgaa 
ctaaattaaaaa 
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FIGURE 14 

MimLPEDMENALTGSQSSHASLRNIHSINPTQmARIESYEGREKKGISDTORTFCLFVTFDLL^ 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
jLis Signal peptide: 

%J amino acids 1-20 

^l' Transmembrane domains: 

m amino acids 54-72, 100-118, 130-144, 146-166 

s 

«)* N-myristoylation sites. 

, amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

if! 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
u AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
O GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
0 TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
m CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
ill TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
j^f TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
III CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
J* CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
|f ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
tfl GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
P GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
m AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
Jjj GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
y GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
111 GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCGGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACAC CACGTAAGT TC T CAGT C CCAACCT CGGGGAT GTGTGCAGACAGGGC TGT GT GAC CACAGC T 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T T AGG AAC AT GTTTTGCTTTTT T AAAAT AT AT AT AT T T AT AAG AG AT C C T T T C C CAT T TAT T C T 
GGGAAGATGT T T T T CAAACTCAGAGACAAGGACT T T GGT T T T TGTAAGACAAAC GATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCH FPPKNAGRLLLELDYADFGCPAT TT TATVPT TRPWRE PTALSS S LAP TWLS PTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAAIAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 



APP ID=1 0063534 



Page 166 



FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCT GATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAAT CCTTAGAT TCCAAGACTACTT TGACATCAGAT GAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
AT TAGAATCC TC TATT CAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGAT AT C AGC T T T C T AGAG T C T C C AAAT C C AGAAAACAAG GAC TAT GAAGAGC C AAAGAAAG T A 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TT T CC TAGATAAGGAGTATGATGAATGTACAT CAGATGGGAGGGAAGATGGCAGACT GT GGTGTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAAC TGGAATGAAAATCCT TAATGGAAG 
CAATAAGAAAAGC CAAAAAAGAGAAGCATAT CGGT ATC T CCAAAAGGCAGCAAGCATGAACCAT A 
CCAAAGCCCTGGAGAGAGTGTCATATGCTGTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
J* GCAGCGAGAGAGAT GT T T GAGAAGC T GAC TGAGGAAGGC TCT CCCAAGGGACAGAC TGC T C T TGG 

a CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
W TT GGAGC TC T TGGGGGCAAT C TAATAGCCCACAT GGTT T TGGTAAGT AGAC T T TAGT GGAAGGCT 

^5 AATAATAT TAACAT CAGAAGAAT T T GT GGTT TAT AGCGGCCACAAC T TT T TCAGC T T TCAT GAT C 

P| CAGAT T TGC T T GTAT TAAGAC CAAATAT T CAGT T GAAC T TCC T T CAAAT TCT T GT TAATGGATAT 

O AACACAT GGAATC TACATGTAAAT GAAAGT T GGT GGAG TCCACAAT T T T TCT T TAAAAT GATT AG 

flj TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTG TCAGAAT CAT T T T T TACAT TAGAT TATCAT AAT T T TAAAAAT T T TT CT T TAGT T T TT CA 
AAAT T T T G T AAAT G G T GGC TAT AGAAAAAC AAC AT GAAAT AT T AT ACAAT AT T T T G CAACAAT G C 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPEMKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASmHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
G TATT ATACAAGAAAGGAGTGTACC TATCACACACAGGGGGAAAAATGC TCT T T T GGGT GCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGAT T TCAT G TAAT CGCT GCCT GT CT GACT GAATCAGGAT CAACAGC T T T AAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
C G CAG T GGG T GAAGAAC C AAG T T G GGGAGAAAGG T CTC T GGGG TCT GAT C AAT AAT GCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCT GT T T GGACT CATCAGTGT GACACTAAATAT GCTTCC T T T GGT CAAGAAAGCT CAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
AT GCAT TGAACCAGGAT TGT T CAAAACAAAC T T GGCAGATCCAGTAAAGG TAAT TGAAAAAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGA7VA 
GCAGAGCTGGC TAAT C CCAAGGCAG T G TGAC T CAGCTAACCACAAATGTC TCC TCCAGGCTATGA 
AAT TGGCCGAT TT CAAGAACACAT CTCCTTTT CAACCCCAT T C CT TATC T GC T CCAACC TGGAC T 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCC T GC T CAAGTT T T C T T TGAAAAGGAGGGC T GGAATGGTACATCACATAGGCAAGTCCT GCCC T 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CATC T C T TAT CTAAATAT TAAAAGATAAGT CAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLI3SfNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVS C I EPGL FKTNLADPVKVIEKKLAI WEQLS PD I KQQYGE G Y I EKS LDKLKGNKS YVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLS™^ 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites . 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGC ATG GAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT TAC T GAT TCCCAAATGGAT GAT GT TGAAGT TG T TT AT ACAAT TGA 
CAT TCAGAAAT AT AT T CCATGCTAT CAGC T T T T T AGC T T T TATAAT TCT TCAGGCGAAGTAAAT G 
AGCAAGCACTGAAGAAAATAT TATCAAATGTCAAAAAGAATGTGGTAGGT TGGTACAAAT TCCGT 
CGT CAT T CAGAT CAGAT CATGACG T T TAGAGAGAGGC T GCT T CACAAAAACT T GCAGGAGCAT T T 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
AT CGAC T GGAACAT T CC T T ATATAAACC TCAAAAAGGAC TT T T TCACAGGGTACCTT TAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TTTTAGCCGAGCAGTACAAACACACAGCTCTAAATTTTTTGAAGAAGATGGATCCTTAAAGGAGG 
TACATAAGATAAATGAAATGTATGCTTCATTACAAGAGGAAT TAAAGAGTATAT GCAAAAAAGTG 
GAAGACAGT GAACAAGCAGTAGATAAACTAG TAAAGGATG TAAA.CAGAT TAAAACGAGAAAT T GA 
GAAJAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAAT C T GACCT TAAT GGTAGAACACACTGACAT TCC TGAAGC TAGT CCAGCT AGTACACCAC 
AAATCATTAAGCATAAAGCC TTAGACT TAGATGACAGATGGCAAT TCAAGAGATCTCGGTTGT TA 
GATACACAAGACAAACGATC TAAAGCAAATACT GGTAGTAGTAACCAAGATAAAGCAT CCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 
CTACAT TT TGAT C CT T T TAACC TT ACAAGGAGAT T T T T T TAT T T GGCTGATGGGTAAAGCCAAAC 
AT T T CTAT TGTT T T TAC TATGT TGAGC T AC T TGCAGTAAGT T CAT T T GT T TT TACT AT GT T CACC 
TGTTTGCAGTAATACACAGATAAC TCTTAGTGCATTTACTTCACAAAGTACT TT TT CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TCT T TAGAAT T GGAAAAG TGAGACCAGGCACAGTGGC TCACACC TGTAAT CCCAGCAC T 
TAGGGAAGACAAGTCAGGAGGATT GATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTAT T 
GAGACCATGTC TAT TAAAAAAT AAAATGGAAAAGCAAGAATAGCCT TAT T T T CAAAATAT GGAAA 
GAAAT T TAT AT GAAAATT TAT CTGAGTCAT TAAAAT TC TCC T TAAG T GAT ACT T T TT TAGAAGT A 
CAT TAT GGC TAGAG TTGC CAGAT AAAAT GC TGGAT ATCATGCAATAAAT T TGCAAAACAT CATC T 
AAAATTTAAAAAAAAAAAZ^\AAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE IEKMKGFGEYSRS PTF 



Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 



N-myristoylation site. 

SI. amino acids 184-154 

o 

fll Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 22 



Li 

f 7 

- ■: 



5 5 S 

□ 



GG CACAGCCGC GCG GCGGAGGGCAGAGTCAGCCGAGCCGAGT CCAG CCGGAC GAGC GGACCAGC GCAGGGCAGCC CAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACC ATG GCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGAT GC C CCCCT CCACGAGAT CAAC GGTGAT CATTT GAAGAT CT GT CC CCAG GGTT CTACCT GCTG CT CT 
CAAGAGATGGAGGAGAAGT ACAGC C TGCAAAGTAAAGAT GAT TT CAAAAGT GTGGTCAGC GAACAGT GCAAT CAT TT G 
CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AATGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTAGAGTTG 
AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
TTCCGCCXGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
T GTTT GGC CAAC CAAGGGGATCTCGATTT T GAAT GGAACAAT TT CAT AGATGCTAT G CT GAT GGT GGCAGAGAG GCTA 
GAGGGTCCTTTC^ACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
AATAGTGT T CAAGT GTCT CAGAAGGTT TT C CAGGGAT GT GGACC C CC CAAGC C C CTCCCAGCT G GACGAATTT CTCGT 
T C CATCT CT GAAAGT GCCT T CAGT GCT C GCTT CAGAC CACAT CAC CC CGAGGAACGCCCAACCACAGCAGCT G GCACT 
AGT T T GGAC CGACT GGTTACTGAT GT CAAGGAGAAACT GAAACAGGC CAAGAAATTCT G GTCCT CCCTT CCGAGCAAC 
GTTT GCAAC GAT GAGAGGAT GGCT GCAGGAAAC GGCAAT GAGGAT GACTGTT GGAATGG GAAAGGCAAAAGCAGGT AC 
CT GTTTGCAGT GACAGGAAATGGATTAGCCAAC CAGGG CAACAAC CCAGAGGT C CAGGTT GACACC AGCAAAC CAGAC 
ATACT GAT C CTT CGT CAAAT CAT GGCT CT T CGAGT GAT GACCAGCAAGAT GAAGAAT GCAT ACAAT GGGAACGACGT G 
GACTT CT TT GAT AT CAGT GAT GAAAGTAGT GGAGAAGGAAGTG GAAGT GGCT GT GAGTAT CAGCAGT GCCCT T CAGAG 
TT T GACTACAAT G C CAC T GAC CAT GCT GGGAAGAGTG C CAAT GAGAAAGCCGACAGT GCT GGT GT CCGT C CT GGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAG ATAA TTCTCAAACTCTGAG 
AAAAAGT GTT CAT C AAAAAGTTAAAAGGCACCAGT TAT CACTT TT CTACCAT CCTAGT GACTT T GCT TT TTAAAT GAA 
TGGACAACAATGTACAGTTT TTACTAT GT GGCCACT G GT TTAAGAAGT GCTGACT TT GT T TT CT CAT T CAGTTTT GGG 
AGGAAAAGGGACT GT GCATT GAGT T GGTT CCT GCT CC CCCAAAC CAT GTTAAAC GTGG CTAACAGTGT AGGT ACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTT T TTT C CAACT GT GAT CT CG CCTT GTTT CT TACAAGCAAACCAGGGT C CCT T CTT GGCACGT AACAT GTAC GT ATT 
T CT GAAATATTAAAT AGCT GTACAGAAG CAGGT T TTATT TATCATGTTAT CTTAT TAAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTl^VLSAALIJ^LKSKSCSEWRLYVSKGraKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRABVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWN^ 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

f*% KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

I j KSANEKADS AGVRPGAQAYLLT VFC I L FLVMQRE WR 



Important features: 
Signal peptide: 

amino acids 1-22 



ATP/GTP-binding site motif A (P-loop) 



J;{j amino acids 515-524 



N-glycosylation site. 

fll amino acids 514-518 



i?3 



Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypioans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C T GAGT CATC CCCAGGGATCAGGAGCC T CCAGCAGGGAACCT T CCAT TATAT T CT T CAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAAT GT GAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT T CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTT CCAAT TAAACAT TC T CAGCCAAGAAGACAG TGAGCACACC TACCAGACAC T C 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATT TAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATAT TT TT 
AAATGTCAAAAAAAAAAAAAAAAAA 



flJ 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 



APP ID=1 0063534 



Page 176 



FIGURE 27 



GGACGCCAGCGC C T GCAGAGGCTGAGCAGGGAAAAAGCCAG TGC CC CAGCGGAAGCACAGC TCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
IZ GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
fh CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
ff! TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
*U GAGCCGACC TGGAAACACATT GGGGAT GGC T GCT GCCT CACCAGAGAGACCT GGAAGGATCTTGA 

*|| GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
~ T T CCCCAGCCTCCAAT TAGAACAAGCCACCCACCAGCCTAT CTATC TT CCACT GAGAGGGACC TA 

D GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
W? AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
J! AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
J5 CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
fjj CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGGCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 20 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TAT GAT T C T TAT ACT AAT T TATACAAAGAT AT TAAGGCCC T GT T CAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



Q 

91 



ru 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 
amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE ftl 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T TGCATC T GTT T T GATAAATGAT GT T GACACCCT CCACC GAAT T CTAAGT GGAAT CATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
T TCAGAT TCAT CAC CAC C CT T CT GGT TCACAT T T T CAT T TCAT T GGT TAT TT TGGGAT T GT TGT T 
T GT C TGCGGTG T T T TATGGTGGCT GTAT TAT GAC TAT AC CAACGACCTCAGCAT AGAAT T GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
IP TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
m ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
J!S AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
*F TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
3 TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAATCAT TG TCATGTACATGCAAAACGCAC T GAAAGAACAGCAGCATGGT GCATTG T 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCAT ATAC T ACAACT GCTAT TAAT GGGACAGAT T T CT GTACATCAGCAAAAGAT GC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
||{ TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GAT GGATCGT CAGAAAAGCCCTAC T T TATGGAT CAAGAAT T TC TGAGT T TCGTA 
AAAAGGAGCAACAAAT TAAACAATGCAAGGGCACAGCAGGACAAGCAC TCAT TAAGGAAT GAGGA 
GGGAACAGAACT CCAGGC CAT T GTGAGATAGAT AC CCAT T TAGGTATC TGTACCTGGAAAACAT T 
T CCT T C TAAGAGCCAT T TACAGAAT AGAAGAT GAGACCAC TAGAGAAAAGT T AG TGAAT T T T T T T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 



m 
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FIGURE 32 



MSGRDTILGLCILALALSLA^FTFRFITTLLWIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGI TAVLLVL I FVLRKRIKLTVELFQI TNKAI SSAPFLLFQPLWTFA 
I L I FFWVLWVAVLLS LGTAGAAQVMEGGQVE YKPLS GI RYMWS YHL I GL I WTSE FI LACQQMT I A 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 



pi Important features: 

pfl Signal peptide: 

sM amino acids 1-20 



Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 



f|| N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3 3 



5S 

m 



J 



GT T CGAT TAGCT CC T C T GAGAAGAAGAGAAAAGG T T CT TGGAC CT C TCCC TGT T TC T TC C T TAGA 
ATAAT T T GTAT GGGAT T T GT GAT GCAGGAAAGCC TAAGGGAAAAAGAATATT CATT CT G T G TGG T 
G AAAAT T T T T T GAAAAAAAAAT TGCCTTCTT CAAAC AAGGG TGT CAT T C T GATAT T TATGAGGAC 
TGTTGTTCT C AC TAT GAAGGCAT C TGT TAT T GAAAT GTTCCTTGTTTT GC TGGT GAC T GGAGT AC 
AT T CAAACAAAGAAACGGCAAAGAAGAT TAAAAGGC CCAAGT T CACTGTGCC TCAGAT CAACT GC 
G AT GT CAAAGCCGGAAAGAT CAT CGAT C C TGAGT T CAT TGT GAAAT GT C CAGCAGGAT GC CAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GG T TACAAAGGGAGTTAT TCCAAC GGT G TCCAAT CG TTATCCC TACCAC GAT GGAGAGAATCC T T 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 



P ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
O CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
m GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
s a AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
*2 GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
f 1 TGGGAGATCCAAACTGCAAAATTGACT TGTCGTT TT TAAT TGATGGGAGCACCAGCATTGGCAAA 

W CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
^.g CGGT CCAC TGAT GGGT GT TGT CCAGTATGGAGACAACCCTGC TAC T CAC TT TAACCT CAAGACAC 

ACAC GAAT TC TCGAGATC TGAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGACTT TC TAAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACT T GCGAGAGAGTCAGGAAT CAACAT T T T C TTCATCACCAT TGAAGG TGCT GCT GAAAAT GAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
|t GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
FU AGAGTT TGAGAT T T CCGACAC GGACAC GCGCATC GGGGCCGTGCAGT ACACCTACGAACAGCGGC 

TGGAGT T TGGGT T CGACAAGTACAGCAGCAAGCCT GACATCC T CAAC GC CAT CAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GT CCAAGCCCAACAAGAGGAAGT TAAT GATCC TCAT CACCGACGGGAGGT CCTACGACGACGT CC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCAT T GCCAC TCACCCC GCCAGAGACCACTCC TT CT T T GT GGACGA 
GT T T GACAACC T CCATCAGTAT GT CCC CAGGATCAT CCAGAACAT T TG TACAGAGT T CAAC TCAC 
AGCCTCGGAACTGAAT TCAGAGCAGGCAGAGCACCAGCAAGTGCTGCT TTACTAACT GACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GAT CACAAACGTATAGAAT GAGCCAAAAGGC T ACATCATGT TGAGGGT GCTGGAGATT T TACAT 
T T T GACAAT T GT T T TCAAAATAAAT GT T C GGAATACAG TGCAGCCC TTACGACAGGC TTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T GTCATGACAATGTAGGAAT T GCT GAAT TAAAT GT T TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAJ^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 3d 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYAS YS S VCGAAVHS GVLDNS GGKI IiVRKVAGQSGYKGS YSNGVQS LS LPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
T PT TLPRP S P SAAS T TS I PRPQS VGHRS QEMDLWS T AT YT S S QNRPRADPG I QRQDPS GAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEK^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI L I TDGRSYDDVRI PAMAAHLKGVI T YAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features: 
Signal peptide: 



amino acids 1-26 

If! 



pi Transmembrane domain: 

p amino acids 181-200 

«i 



N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites . 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 

CCGAGC ACAGGAGATT GCCT GC GT TT AGGAGGT GGCT GCGT TGT GGGAAAAG CT AT C AAGGAAGAAAT TGC 
CAAACC AT GT CTT T TT TTCT GT TTTC AGAGT AGT TCAC AACAG ATCTGAGTG TTTT AATT AAGC AT GG AAT 
AC AGAAAACAAC AAAAAACT T AAG CT TT AATT TC ATCT GG AATTCCAC AGTT TT CT T AGCT C CCTGGACC C 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT C AGCCT T C CCC ACT ACAATGTG AT AG AACGCGT GAACT GG AT GT ACTT CT ATGAGT ATGAGC CG AT TT A 
C AGACAAG AC TT TC AC T T CACACTT C GAGAGC AT T C AAACTGCT CT C ATC AAAAT C C ATTTCTG GT CATT C 
TGGT GACCTCC CACCCT TCAGATGTG AAAGCC AGGCAG GCCAT T AG AGT T AC TTGGGG TGAAAAAAAGT CT 
TGGT GGGGAT AT GAGGT T CT TAC ATTTT TC TT AT T AGGCCAAGAGG CTGAAAAGGAAGACAAAATGTTGGC 
AT TGT C CT T AGAGGAT GAAC ACCTT C TT T ATGGT GAC ATAATCCGACAAGAT TTTT TAGACAC ATAT AAT A 
AC CT GACCTT GAAAAC CAT TAT GGCATT C AGG TGGGT AACTGAGTT TT GC CCC AATGC C AAGT ACGT AAT G 
AAGACAGACACT GATGTTT T CATCAATACTGGCAATTT AG TGAAGT ATCT TT T AAACCT AAACC ACTC AGA 
GAAGTT TT TCAC AG GT TAT C CT CTAATT GAT AAT TATTCC TAT AGAGGAT TT T ACC AAAAAACC CAT ATT T 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TT GGT GCC AAGG AT CT AT GAAATG AT GGGTC ACGTAAAACC CAT CAAGT T TG AAGATGTT T ATGTCGGGAT 
CT GTT TGAAT TT AT TAAAAGTG AACATT C AT ATT CCAGAAGACACAAAT CTTTTCTTT CT AT AT AG AATCC 
AT TTGG AT GT CT GT CAACT G AG AC GT GT GAT T GC AGCC CAT GG C TT TTCT TC C AAG GAGATC AT CACT TTT 
TGGCAGGTCATGCTAAGGAACACCACATGCCATTAT TAA CTTCACATTCTACAAAAAGCCTAGAAGGACAG 
GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
AC ACTGAACTGAAACT C ATGAAAAAC C C AGACTGGAGACT GG AGGGTT AC AC T TGT GATT T ATT AGTC AGG 
CC CTT CAAAG AT GAT AT GT GGAGG AAT T AAAT AT AAAG GAATT GGAGGTT TT T GCT AAAGAAAT TAAT AGG 
AC C AAACAATTT GGAC AT GT CATT CT GT AG AC T AGAAT TT CTT AAAAGGGTGTT AC TG AGTT AT AAGC TC A 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CT GAAC AAAAT T TTACCT GT TT TT GGTC AT TT AT AAAGTACTTC AAGATGTT GC AGT ATT T CAC AGTT AT T 
AT TAT T TAAAAT TACT TC AACT TT GT GT T T T T AAATGT TT TGAC GATTT C AAT ACAAG AT AAAAAGGAT AG 
TGAATC AT TCTT T AC ATG C AAACATT TT CC AG TT ACT T AACTG ATC AGT T TATT AT TG AT AC AT CACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAG AATT AAAGCAAGAAAATC T GAAAA 
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FIGURE 26 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
T LREHSNC S HQNP FLVI L VT S HPS DVKARQAI RVT WGEKKS WWGYE VLT FFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



if! 
W 



CGCTCGGGCACCAGCCGCGGCAAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGTACACAGTCAT TAATGAAGCC TGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AAT ATGATC AGATTGAGT GCGTC TGCCCC GGAAAGAGGGAAGTC GTGGGTTATACCATC C C TTGC TGC AGGAATGAGGAGAA 
TGAGTGT GACTCCTGC C TGATCCAC CCAGGT TGTAC CAT CT TTGAAAAC T GCAAGAGCT GCCGAAATGGC TCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACC T GGGT TTGTC ATCC AAC TAAGAT T TGTCATGTT GAGTC TGGAGTT TGAC TACATGTGCCAGTATGACTATGTT GAG 
GTTC GTGAT GGAGAC AACCGCGATGGC C AGATC ATCAAGCGTGT CTGT GGC AACGAGCGGCCAGC TCC TATCC AGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
T ATAC TGGGCAGCGC TGTGAAAATC TC C T TGAAGAAAGAAACT GCT CAGACCC TGGGGGCCCAGTCAAT GGGTACC AGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TCT TAGT GGCAATGAGAAAAGAAC T TGC C AGC AGAATGGAGAGTGGTCAGGGAAACAGCC C AT CTGCATAAAAGCC TGCCGA 
GAAC CAAAGAT T TCAGAC C TGGTGAGAAGGAGAGT TC TTCCGAT GCAGGT TCAGTCAAGGGAGAC ACCAT TACACCAGC TAT 
AC TCAGC GGCC T TCAGCAAGCAGAAAC T GCAGAGT GC CCC TAC C AAGAAGCCAGCCC TTC CCT T TGGAGAT C TGCCCAT GGG 
ATAC CAACATC TGCATACC CAGC TC CAGT ATGAGTGC ATC TCAC CCT T C TACC GCCGCCTGGGC AGCAGC AGGAGGACATGT 
C TGAGGAC T GGGAAGTGGAGT GGGCGGGC ACC ATC C TGCATCCC TATC TGC GGGAAAATTGAGAACATC ACT GC TCCAAAGA 
CCCAAGGGT TGCGCTGGCC GT GG CAGGC AGCC ATC TACAGGAGGACCAGCGGGGTGCATGACGGC AGC C TAC ACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTC AC CATGATCAAGAC AGCAGACC TGAAAGT TGT T T TGGGGAAATT C TAC CGGGAT GAT GAC C GGGAT GAGAAGAC CATCC 
AGAGC C T ACAGAT T TC TGC TATCAT TC TGCATCCCAACTAT GAC CCC AT CC TGCT T GATGC TGACATC GCC ATCC TGAAGC T 
CCTAGACAAGGC C CGTATC AGCACC CGAGTCC AGC CCATCTGCC TCGC TGCCAGT C GGGAT CTCAGCACTTCC TTCCAGGAG 
TCCCACATC AC TGTGGCTGGC TGGAAT GTCC TGGC AGACGTGAGGAGCC C TGGCT T C AAGAAC GACACAC TGCGC TC TGGGG 
TGGTCAGTGTGGTGGACTCGC TGC TGT GT GAGGAGCAGCATGAGGAC C ATGGC AT C C CAGT GAGT GT CAC TGATAAC AT GT T 
CTG TG CCAGCTGGGAACCCAC TGCCCC T T C TGATATC TGC ACT GCAGAGACAGGAGGCAT CGCGGCTGTGTCC T TC CCGGGA 
CGAG CAT CTCC TGAGCCAC GC TGGCAT C T GATGGGAC TGGTCAGCTGGAGC TATGATAAAACAT GCAGC CAC AGGC TCT CCA 
CTGC C T TC ACC AAGGT GC TGCCT TT TAAAGAC T GGAT TGAAAGAAAT AT GAA ATGAA CCATGC TCATGCACT CC T TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGC CAGGGCTTC TGAC T T CAGGGACAAAACT C AGTGAAGGGTGAGTAGACC TCC AT TGCT GGTAGGC T GAT GCCGCGT CC A 
C TAC T AGGACAGCCAAT TGGAAGATGC C AGGGC T TGCAAGAAGTAAGT T TC TTCAAAGAAGAC CATATAC AAAAC C T CTCC A 
C TCC ACT GACC TGGTGGTC T TCC C CAAC TTTC AGT TATACGAAT GCCAT CAGC T TGACCAGGGAAGATC TGGGC T T CAT GAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
T GT G TAC AT GGC C AC AGTAC AGTC T GGT C C T T T TC C TTC CCCATC TC T T GTACACAT TT T AAT AAAAT AAGGG T T G GC T T C T 
GAAC TACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 28 

melgcwtqlglt'flqlllisslpreytvineacpgaewnimcregceydqiecvcpgkrewgyt 
i pccrneene cds cl i hpgc t i fencks crngs wggt lddfyvkg fycaecragw yggdcmrcgq 
vlrapkgq ille s yplnahcewt ihakpgfvi qlrfvml s le fdymcqydyvevrdgdnrdgq 1 1 
krvcgnerpapiqsigsslhvlfhsdgsknfdgfhaiyeeitacssspcfhdgtcvldkagsykc 
aclagytgqrcenlleerncsdpggpwgyqkitggpglingrhakigtwsffcnnsyvlsgne 
krtcqqngewsgkqpicikacrepkisdlvrrrvlpmqvqsretplhqlysaafskqklqsaptk 
kpalpfgdlpmgyqhlhtqlqyecispfyrrlgssrrtclrtgkwsgrapscipicgkienitap 
ktqglrwpwqaaiyrrtsgvhdgslhkgawflvcsgalvnertvwaahcvtdlgkvtmiktadl 
kwlgkfyrdddrdektiqslqisaiilhpnydpilldadiailklldkaristrvqpiclaasr 
dls t s fqe shi tvagwnvladvrs pg fkndtlrs gws wdsllceeqhedhgi pv5vtdnmfca 
sweptapsdictaetggiaavsfpgraspeprwhlmglvswsydktcshrlstaftkvlpfkdwi 

ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites . 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Ami da ti on site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



III 



GGTT C CTACAT CCT CT CAT CT GAGAAT CAGAGAGCATAAT CTT CTTACG GGC CC GT GAT TTATT AAC GT GGCT TAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTT GGCT CAGT C CT GCTAACTACATT GACAAT GT GG GCAAC CT GCACT TCCT GTATT CAGAACTCT GTA 
AAGGT GC CT C C CACTAC GGCCT GAC CAAAGAT AG GAAGAGGC GCT CACAAGAT GGCT GT CCAGAC GGCT GTGCGAG CC 
T CACAGC CACGGCT CC CT C CC CAGAGGTTT CT GCAGCTGC CACC AT CTC CT T AAT GACAGACGAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
T CTTT C CAAGGTTGT ACCAC CT GAT TCCAGAT GGT GAAATTACCAGCAT CAAGAT CAAT C GAGTAGAT CCCAGT GAAA 
GC CT CT CTATT AGGCTGGTGGGAGGTAG CGAAACC C CACT GGTC CATAT CATTAT CCAACACATT TAT CGT GAT GGGG 
T GAT C GC CAGAGAC G GCCG GCTACT GCCAGGAGACAT CATT CT AAAG GT CAACGG GATGGACAT CAGCAATGT CC CTC 



III ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

y I GCAG GAACAAT GGACAGG C CCCGGAT GC CT ACAGAC C CCGAGAT GACAGCTT T CAT GT GATT CT CAACAAAAGTAGCC 

UJ CCGAGGAGCAGCT T GGAATAAAACT GGTGC GCAAG GT GGAT GAGCCT GGGGT TTT CAT CT TCAAT GT GCTGGAT GGCG 

*j" GT GT GGCAT ATCGACAT GGT CAGCTTGAGGAGAAT GACC GT GT GTTAGC CAT CAATGGACAT GAT CT T CGATAT GGCA 

s GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

U' GGAGCCCTGACAT CTTTCAGGAAGCCGGCT GGAACAGCAATGGCAGCT GGTC CC CAGG GCCAGGGGAGAGGAG CAACA 

III CT CCCAAGCCCCT CCATCCT ACAATTACTT GT CAT GAGAAGGTGGT AAATAT CCAAAAAGAC C CCGGTGAAT CT CT C G 

M GCAT GAC CGT C GCAGGGGGAGCATCACATAGAGAAT GG GATTT GCCTAT CTAT GT CAT CAGT GTT GAG C CCGGAGGAG 

Hi 

s *tf T CAT AAGCAGAGATGGAAGAATAAAAA.CAG GT GACATTT T GTT GAAT GT GGAT GGGGT C GAACT GACAGAGGT CAGCC 

Ui GGAGT GAGGCAGT GGCATT ATT GAAAAGAACATCAT C CT C GATAGTACT CAAAGCTT TGGAAGT CAAAGAGTAT GAGC 

CCC^GGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCC^ACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GG GT CAT GT GGCT GGAATTACCACGGT GCTTGTATAACT GTAAAGATATT GTATTACGAAGAAACACAGCT GGAAGT C 
T G GGCTT CT GCAT T GTAGGAGGTTAT GAAGAATACAAT GGAAACAAACCT TTTTT CAT CAAAT C CATT GTT GAAG GAA 
CACCAGCATACAAT GAT GGAAGAATTAGAT GT GGT GATATTCT T CTT GCT GT CAAT GGT AGAAGTACAT CAGGAATGA 
TACAT GCTT GCT T GGCAAGACT GCT GAAAGAACTT AAAGGAAGAATTACT CTAACTAT T GTTT CT T GGCCTGGCACT T 
TTTTATAGARTCAATGATGGGTCAGAGGAAAACAGA^ 

TT GT CAGTT T TTATATTTAAAGAAAGAAT ACATT GTAAAAAT GT CAGGAAAAGTATGAT CAT CTAAT GAAAGC CAGTT 
ACACCTCAGAAAATATGATTCCAAAAAAATTAAAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTAC 
AACATTGTTTATATTTTTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAG CTGATTTAAATTTAAAAT TT GGTATATGCT GAAGT CTGCCAAGGGT ACATTAT GGCCATTT TT AATTTACAGCT 
AAAATATT TTTTAAAAT GCATT GCT GAGAAACGT T GCTTT CAT CAAACAAGAATAAAT ATTT TT CAGAAGTTAAA 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDE PGLDNPAYVS SAEDGQPAI S PVDSGRSNRTRARPFERST I RSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI I IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNT PKPLHPT I TCHEKWNI QKDPGE SLGMTVAGGAS HREWDLP I YVI S VE PGGVI SRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHKMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293,. 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAA.CTTGAAGCTTT 
CTT GCCTGCAGTGAAGCAGAGAGATAGATAT TAT T C AC G T AAT AAAAAAC AT G GG C T TCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAAT GAAGCAT CCACGAAGAAGGTAGAACT T 
GACAACT GT CCT T CT GTGT CTCCTTACCTCAGAGGCCAGAGCAAGCT CATTT TCAAACCAGAT CT CAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACT GC TT TATATT CCACGATGTGGACCT GGTACCCGAGAAT GACT TTAACC T TTACAAGT GTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCT GAAGTGGGTAAATATACAATGGTCTT CCACACTAGAGACAAAGGCAAT GAGGTGAACGCAGAACG 
GAT GAAGCT CTTACACCAAGT GTCACGAGTCTGGAGAACAGATGGGT T GAGTAGT TGTTCT TATAAAT 
TAGTATCTGTGGAACACAAT CCTTTATATAT CAACAT CACAGT GGAT T T CT GGTT T GGTGCATGACC C 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGT TACAGCT CATT GT T GAGCTGAAT T TTTCCT T TTTGTATTT TCT 
TAG C AGAGC T CCT GGT GAT GT AGAGTAT AAAAC AG T T GT AACAAGACAG CT T TCT T AGT CAT T T T GAT 
CAT GAGGGT TAAATAT TGTAATAT GGATACTT GAAGGACTT TATATAAAAGGATGACTCAAAGGATAA 
AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACT GTTAT TCAT TTAT CC T GTACAATCATCT GTGAAGT GGT GGT GTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT TAAAGAGTTT TT GTAAAATGAT TTT GTACAAGTAGGAT ATGAATTAGCAGTT TACAAGTTTACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVS PYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYG I YVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFI FHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 



APPJD=1 0063534 



Page 192 



r. 

5 . S. 

■ y=i 



FIGURE 4.^1 

GC TCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATCGCACT GAGCT CCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGC^ACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTGCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAA.CLLLLLLLASLTS 
CI FCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 



m 
pi 

-VIS* 

ru 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 

G T T T C T G AAGAG AGAGAG AC AAG AAG AG T AC AT T G AAG AG AAG AAG AGAG T G G AC A T TTGTCGGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT TTAC TCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TC TAGA CAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPWELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWPCALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVS RNFSSPI LARKLCEGAADDPDS SMVLLCLLLVPLLLS L FVLGL FLWFLKRERQEE Y I E 
EKKRVDI CRETPNI CPHSGENTEYDT I PHTNRT I LKEDPANTVYSTVE I PKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features : 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 22 4-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGA.CCTGCTGCGAA.GGATGGACATCCTGCAA.TGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCT^ATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCAT TCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
M AACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGACATTCATCCAGAATCCTTCAA 
D CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 

III ACACCATGGCGAG T GGC T GGAGAGCAT CTAGT T T CCAC T TCGAT T C TGAAGAAAACAAACATAGG 

il s 

S 5 s CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 

in C AG T CAGAT AGT C AT CGGT TT CC T TGGCTGTCTGTG T GGAGT CT C T AAGCGAAGAAGTC AAAT T G 

ly TGTAGT T TAAT GGGAATAAAATGTAAGTATCAGTAGT T T GAAAAAAAAAAA 



ru 
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FIGURE 48 

MTCCEGWT SCNGFSLLVLLLLGWLNAI PL I VSLVEEDQFSQNPI SCFEWWFPGI I GAGLMAI PA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

Q amino acids 10-31 (type II), 50-72, 87-110, 191-213 

*<M N-glycosylation sites. 

J : }f amino acids 80-84, 132-136, 148-152, 163-167 

i-l s 

III 

J* cAMP- and cGMP -dependent protein kinase phosphorylation site. 

a amino acids 223-227 

If! 

'p: N-myristoylation sites. 

PI amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

ri' 

fll Prolcaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
p| CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
O TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGGCCCT 

v 1 T T C T GATCAGGAGGCT T C TT TAT GAAT TAAACTCGCCCCACCACCCCCT CA 

til 



Si! s 
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FIGURE so 

MERVTLALLLIJVGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 
amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 

G TGGAC TC TGAGAAGCCCAGGCAGT T GAGGACAGGAGAGAGAAGGCTGCAGAC CCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC T GGTGGGAGGAAGACACT CTGGAGAGAGAGGGGGCT GGGCAGAG ATGA AG T TC CAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
O TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
Ql GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
W CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
^ J GAAACT CAGCAGGCAGC T TT GGAATGAATCC T CAGGGAGC TCCCT GGGGTCAAGGAGGCAAT GGA 

*jS GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

asps; 

s AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
O ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
ttj AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
|l TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
f|| CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CT T CAGAGGACAGGGAGT T TCCAGCAACAT GAGGGAAATAAGCAAAGAGGGCAAT CGCC TCC T TG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATT T TAAAT CCAAGC T GGGT T T CAT CAAC TGGGATGCCATAAACAAGGACCAGAGAAGCT CT C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAA 
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FIGURE 52 



MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGS WQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE sa 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGA6CCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
U TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
Q AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
.O GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
F! CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
jjjf CAGAAATGCAT C T T CAGC T T TGACAGCCAT TGT CAGGAGAGGC C CAGT GAATAT AT T GCCAC CAT 

h| CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
*B ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
V GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
PI AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
P CATT GTCAGAT GAGGATATAAGAGCAGAGGC TGACACC T T CATG TT TGGAGGCCATGACACCACG 

f|.j GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
0 ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
111 AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLLSLPWLGLRPVAMSPWLLLLLWGS 

L I TPTEEGLKDST QMSATYSQGFTVWLGP 1 1 P FI VLCHPDT IRS I TNASAAI APKDNLFI RFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAFI PFSAGPRNC I GQAFAMAEMKWLALMLLH FRFLPDHTE PRRKLEL IMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains : 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature, 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE ss 

AT CGCATCAATT GGGAGTACCAT C T T CC TCATGGGACCAGTGAAACAGC TGAAGCGAATG T TT GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
T TT GGT GGCATAACAAGGGAC T T GCAC T TATC TT C T GCAT T T T GCAGT C T T TGGCAT TGACGT GG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATA&T TCATGGCCAGTT TTATGAAGCTTTGGAAGGCAC TATGGACAGAAGCTGGTGGACAGTT T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGAT T CGAACAT T TGAGGGTTAC T T T T GGAAGCAACAATACAT TC TCGAACC T GAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT^^ 
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FIGURE s6 

MGPWQLKRMFEPTRLIATIMVLLCF 
DAVKKC FAVCLA 



Important features: 
Signal peptide: 

amino acids 1-33 

CI 

Type II fibronectin collagen-binding domain protein. 

pi amino acids 30-72 

iii 

If! 
Ill 

O 
If! 
Q 

ru 
p 
ru 
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FIGURE 52 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
G GAGATAAGGACCTT GAT G GGCAGCT AGACTT T GAAGAATT T GT C CAT TAT CT C CAAGAT CAT GAGAAGAAGCT GAGG 
CT GGT GTTTAAGATTT T GGACAAAAAGAAT GATGGAC GCAT T GAC GC GCAGGAGATCAT GCAGT CCCTGCGGGACTT G 
GGAGT CAAGAT ATCT GAACAGCAGGCAGAAAAAATT C T CAAGAGCAT GGATAAAAAC GGCACGAT GAC CATCGACTGG 
AACGAGT GGAGAGACTACCACCT CCT CCACCC CGT GGAAAAC AT C CC CGAGAT CATC C T CTACT GGAAGCAT T CCACG 
AT CTT TGAT GTGGGT GAGAAT CTAACG GT CCC GGAT GAGTT CACAGT GGAGGAGAGGCAGACGGGGAT GT GGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GC CT CC C GCAGC AACAACATGGG CAT C GTT GGTGGCT T CACT CAGATGATT CGAGAAGGAGGGGC CAGG 
T CACT CT GGCGGGG CAAT G GCAT CAAC GT CCT CAAAATT GCC CCCGAAT CAGC CATCAAATT CAT GGC CTAT GAG CAG 
AT CAAGCGC CTT GTT GGTAGT GACC AG GAGACTCT GAGGAT TCACGAGAGGCTT GTGGCAGGGT CCT T GGCAGGGGCC 
AT CGCCCAGAGCAG CAT CTACCCAATG GAGGT CCT GAAGACC C GGAT GGCGCT GCGGAAGACAGGCCAGTACTCAGGA 
AT GCT GGACT GCGCCAGGAGGAT CCT G GCCAGAGAGGGGGT GGCCGCCTT CTACAAAGG C TAT GT CCCCAACAT G CT G 
GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
C C CCT GGCC CTAGT CAGGACCCGGATG CAGGC GCAAGCCT CTAT T GAGG GCGC T CCGGAGGT GACCAT GAGCAGC CT C 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCT GT GAGCATCAG CT AC GT GGTCTAC GAGAACCTGAAGATCACCCT GG GCGT GCAGT CGCGGTGACG GGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCT GTCT CGAGC CAAG CT GT GAAAACCCTAGACGCACC CGCAGGGAG GGT GGGGAGAGCT GGCAGG CCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
AT AAT CCAT GAT GAAAGGT GAGGT CAC GT GGC CT CCCAGGCCTGACT T CC CAAC CTACAGCATT GACGCCAACTT GGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCT GGAC CCT GT CAGGAT GGGCCCCACCT CAGAACCAAACT CACT GT C CC CACTGT GGCAT GAGGGCAGT GGAGCA 
CCAT GTTT GAGGGC GAAGGGCAGAGCGT TT GT GT GTT CT GGGGAGGGAAGGAAAAGGT GT T GGAGGC CT TAATTATGG 
ACTGT TGGGAAAAG GGT TTT GT C CAGAAGGACAAGCC GGACAAAT GAGC GACTT CT GT GCTT CC AGAGGAAGACGAGG 
fil GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACC AGC CCCACATT C CACTTGT GT CACTGCTT GGAAC CTAT TTATTT TGTAT TTATT TGAACAGAGTTAT GT CCT 
AACTATTT TTATAGAT T T GT TTAATTAAT AGCTT GTCATTT T CAAGTT CATTT T TTAT T CATATTTAT GTTCATGGTT 
GATT GTACCTTCCCAAGC C CGCC CAGT GGGAT GGGAGGAG GAG GAGAAGGGGG GCCT T GGGCC GCTGCAGTCACAT CT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTT GCC C CCAGCCTTAGGATTT CAGGGTT T GACTGGGGGC GT GGAGAGAGAGGGAGGAACCT CAAT 
AACC T TGAAGGTGGAAT CCAGTTATTT CCTGCGCTGC GAG GGT T T CTT TATTT CACT CTT TT CT GAAT GT CAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTT CAAT AGGAT GCAAAGAT CAAT GCAAAAAT T GTTATATATGAACATATAAC T GGAGT CGT CAAAAAG 
CAAAT TAAGAAAGAAT T G GAC GTTAGAAGTTGTCATTTAAAGCAGCCTT C TAATAAAGTT GTT T CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE s8 



W 



MLCLCL YVPVT GEAQTE FQYFE SKGLPAELKS I FKLSVFI PSQE FS T YRQWKQKI VQAGDKDLDG 
QLD FEE FVH YLQDHEKKLRLVFKI L DKKNDGR I DAQE IMQS LRDLGVK I S E QQAEK I LKSMDKNG 
TMT I DWNEWRD YHLLHPVEN IPEII LYWKHS T I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGI VGGFTQMI REGGARS LWRGNG INVLKIAPE SAI K 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAF YKG YVPNMLG 1 1 P YAG I DLAVYET LKNAWLQHYAVNSADPGVFVLLACGTMS S T C 
GQIASYP1AXTOTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVSISW 
VYENLKI TLGVQSR 



IP 

III Important features: 



Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 5Q 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATGCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CAC TCAT CAT T GGCTT TGGTAT TT CAGGGAGACACT CCATCACAGT CAC TACTGTCGCCTCAGC T 
GGGAACAT T GGGGAGGAT GGAATCCTGAGC T GCAC TT T T GAACCT GACAT CAAACT TT CTGATAT 
CGT GATACAAT GGC TGAAGGAAGGT GT T TTAGGCT TGG T CCATGAGT TCAAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
H GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
if CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 

Gl 

ff$ AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 

1 1 1 CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 

111 CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

4* T CAACAACACATACT CC T GT ATGAT T GAAAATGACATT GCCAAAGCAACAGGGGAT AT CAAAGT G 

y ACAGAATCGGAGAT CAAAAGGC GGAGT CACCTACAGCT GC TAAACT CAAAGGC TTCTCTGTGTGT 

m CTCTTCTTTCTTTGGCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

|f| GTGCCT TGGCCACAAAAAAGCATGCAAAGTCAT TGT TACAACAGGGATC TACAGAACT AT T TCAC 

O CACCAGAT AT GAC CT AG T TT TATAT T T C TGGGAGGAAAT GAAT T CATAT C TAGAAGT CTGGAGT G 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 

5! CTATCT TCAAAGACATAT TAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGT TAAGA 

III 

GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAG T GAGAGGACAGGATAGTGCATGT T C T T T GTC T CT GAAT T T T TAGT T ATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T CT TCCCAAC T GACAAATGCCAAAGTT GAGAAAAAT GAT CATAAT T T TAGCATAAACAGAGCAGT 
CGGGGACACCGATT TTATAAATAAACTGAGCACCT TC TT T TTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVIsr^YNASSETLRCEAPRWFPQPTVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYOTTIIMTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPLS PYLMLK 

3»ls Important features: 

O Signal peptide: 

:f amino acids 1-28 

it 1 

HI Transmembrane domain: 

yj amino acids 258-281 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGT CAGAAT CACC ATGG CCAGCTAT CC T T AC C GGCAGGGCTGC CCAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGT GGAC TCAGATCACAGTGGCTATATC TCCATGAAGGAGC TAAAGCAGGCCC T GG T CAAC TGCA 
AT TGGTC TTCATTCAATGATGAGACCTGCCTCATGATGATAAACAT GTT TGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGAGTTCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
m ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
f|| AATAGT GAGGACCGGGGC TGAGGC CACACAGATAGGGGCC TGAT GGAGGAGAGGATAGAAGTT GA 

C3 AT G T CCTGATGGCCATGAGCAGT T GAG T GGCACAGCCT GGCACCAGGAGCAGGTCCT TGTAAT GG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T T GGGGCCAAAAGTCCAGTGAAAT TGTAAGCTTCAATAAAAGGATGAAACTCT GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCI^MIIMFDKTKSGRIDWGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLSFEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6* 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TG'CCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
AT C TGGGGTAT CAATAATCAACCCCCAGG TACCAGCT GGGGAAAT AT TAATCGG TAT CCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGC TAGAGCAC GATAGAGGGAAAC CCAACAT TGGGAG T TAGAGT C C T GC TCCCGCCCC T TGCT G 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
HI AAAAAAAAAAAAAAAAAAAAAAAA&l^^ 



TV : 
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FIGURE 6a 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDIARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPJ^DSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



4> Important features of the protein: 

*f Signal peptide: 

it amino acids 1-2 6 

II Casein kinase II phosphorylation sites. 

II amino acids 56-59, 155-158 



N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 



ri 

if! 

fit 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



m 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGGGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTAT TCACT TT TATATAT T TATATAAAAT TAG 



h| TAG T G AGAT G T A?yy\AAAAAAAAAAAAAA 



?ll 
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FIGURE 68 



MANPGLGLLLALGLP FLLARWGRAWGQI QTT SANENS T VLPS S TS S S SDGNLRPEAI TAI I WFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 8 6-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GC CAGGAATAACTAGAGAGGAACA ATG GGGTT ATT CAGAGGT T T T GTT TTCCTCTTAGTT CTGT GCCT GC TGCAC CAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAAT AGAGGATAT GGT GACT ACAGCT T CTACGTACCT GTTT GAAGCCACAGAAAAAAGA 
TTTTTTT TCAAAAAT GTAT CTATATTAAT T CCTGAGAATT GGAAGGAAAATCCT CAGTACAAAAGGC CAAAACAT GAA 
AACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCATACACCAAGCAGTTCACA 
GAAT GT GGAGAGAAAGGCGAATACATT CACT T CACCC CTGACCTT CT ACT TGGAAAAAAACAAAATGAATAT GGACCA 
CCAGGCAAACT GTTT GT CCAT GAGTGGGCT CACCT CC GGT G GGGAGT GTT T GAT GAGTACAAT GAAGAT CAGC CTTT C 
T ACC GTGCTAAGT CAAAAAAAATCGAAGCAACAAGGT GT T CC GCAGGTAT CT CT GGTAGAAATAGAGT T TATAAGT GT 
CAAGGAGGCAGCTGT CTTAGTAGAGCATGCAGAATTGAT T CTACAACAAAACT GTAT GGAAAAGATT GT CAATT CTT T 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATGCAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAA 
8 : AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAAT 
T CTGAGGATT T TAAAAACACCATAC C CAT GGT GACACCAC CT CCT C CACCT GTCT T CTCATT GCT GAAGAT CAGT CAA 
Q AGAATT GTGT GCTT AGTTCT T GATAAGTCT GGAAGCATGGGGGGTAAGGAC C GC C TAAAT C GAAT GAAT CAAGCAGCA 

fe\ AAACATTTCCT GCT G CAGAC T GTT GAAAAT GGAT CCT GGGT GGGGAT GGT T CACT TT GATAGTAC TGC CACTATT GTA 

Izt AATAAGCTAAT CCAAATAAAAAGCAGT GAT GAAAGAAACACACT CAT GGCAGGATT ACC TACATATCCT CT GGGAGGA 

pi ACTT CCATCT GCTCT GGAAT TAAAT AT GCAT T TCAGGT GATT GGAGAGCTACATT C C CAACT C GAT GGAT CC GAAGTA 

$„ | CT GC T GCTGACT GAT GGGGAGGATAACACTGCAAGT T CTT GTATT GATGAAGT GAAACAAAGT GGGGCCATT GT T CAT 

J^f TTTATTGCTTT GGGAAGAGCT GCT GAT GAAGCAGTAAT AGAGAT GAGCAAGATAAC AGGAGGAAGTCAT TT T TAT GT T 

f|| TCAGAT GAAGCT CAGAACAAT GGC CT CATT GAT GCT TTT GGG GCT CTTACATCAGGAAATACT GAT CT CT C CCAGAAG 

3 j T CCCTTCAGCT CGAAAGT AAGGGAT T AACAC T GAATAGTAAT GC CT GGAT GAAC GACACT GT CATAATT GATAGTACA 

GT GGGAAAGGACACGTT CTTT CTCATCACAT GGAACAGT CT GCCT CC CAGTATTT CT CT CT GGGATCCCAGT G GAACA 
*P ATAATGGAAAATTTCACAGT GGAT GCAACTT C CAAAAT GGCCTAT CT CAGTATT CCAGGAACTGCAAAG GT GGGCACT 

TGGGCATACAAT CTT CAAGCCAAAGC GAAC C CAGAAACAT TAAC TATTACAGTAACT T CT CGAGCAGCAAAT TCT T CT 
5 r GT GC CTCCAAT CACAGT GAAT GCTAAAAT GAATAAGGACGT AAACAGTTT CC CCAGCCCAAT GAT TGTTTACGCAGAA 

llf ATT CTACAAGGATAT GTACCT GTT CTT GGAGC CAAT GTGACT GC T TT CAT T GAAT C ACAGAATGGACATACAGAAGT T 

j's^j TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
ACAGAAAAT GGCAGATAT AGCT TAAAAGT T CG GGCT CATGGAGGAGCAAACACT GC CAGGCTAAAAT TAC GGCCT CCA 
Q CT GAATAGAGC CGCGTACATACCAGGCT GGGTAGT GAACG GGGAAATTGAAG CAAACCC GC CAAGACCT GAAAT T GAT 

fU GAGGAT ACT CAGAC CAC CTT GGAGGATTT CAGCCGAACAGCATC CGGAGGT G CAT TT GT GGT ATCACAAGT C CCAAGC 

; j;f CTTCCCTTGCCT GACCAATAC C CAC CAAGT CAAAT CACAGAC CTT GAT GCCACAGTT CAT GAGGATAAGATT ATT CT T 

IJJ ACAT GGACAGCACCAGGAGATAAT TT T GATGTTGGAAAAGTT CAACGTTATATCATAAGAATAAGT GCAAGTATT CT T 

jfj | GAT CTAAGAGACAGT TT TGAT GAT GCT CTT CAAGTAAAT ACTACT GATCT GT CACCAAAGGAGGCCAACTC CAAG GAA 

5 ^ AGCT TT GCATTT AAACCAGAAAATAT CT CAGAAGAAAAT GCAACC CACATATTTATT GC CATTAAAAGTATAGATAAA 

AGCAAT TT GACATCAAAAGTAT CCAA.CAT T GCACAAGTAACT TT GT TTATCC CT CAAGCAAATC CT GAT GACATT GAT 
CCTACAC CTACT CCTACTC CTACT C CTACT CCTGATAAAAGT CAT AATT CT GGAGTTAATAT TT CTACGCT GGT ATT G 
T CT GT GATT GGGTCT GTTGTAATTGTTAACT TTAT TTTAAGTAC CACCATT TGAA C CTT AACGAAGAAAAAAAT CTT C 
AAGT AGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT AAAGGATATTT CT GAAT CTTAAAATT CAT CC CAT 
GT GT GAT CATAAACT CAT AAAAATAATTT TAAGAT GT CGGAAAAGGAT ACT TT GATT AAATAAAAACACTCATGGATA 
TGTAAAAACT GT CAAGATTAAAAT TTAATAGT TTCATTTAT TT GTTATTTTAT TT GTAAGAAATAGTGAT GAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTATCTGAGTAGT CAAAATACAAGTAAAGGAGAGCAAATAAACAACAT TT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDSKIIEQIEDMVTTASTYLFE 
ATEKRFFFKOTSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
I HFT PDLLLGKKQNE YGPPGKLFVHEWAHLRWGVFDE YNEDQP FYRAKSKK I EATRCSAG I S GRN 
RVYKCQGGSCLSRACRI DSTTKLYGKDCQFFPDKVQTEKAS IMFMQS IDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 

H YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 

!j? ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 

T, | AANS S VP P I TVNAKMNKDVNS FPS PMIVYAE I LQG YVP VLGANVT AFI E S QNGHTE VLE LLDNGA 

HI GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANT^^ 

III RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRY I IRI SAS I LDLRDS FDDALQVNTTDLS PKEANSKE S FAFKPEN I SEENATHI FI AI 

s 

p.. KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
111 VNFILSTTI 

III Signal peptide: 

I:: amino acids 1-21 

HI 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 47 6-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGT GGAAACCCT GGGAGTAGAGTACT GACAGCAAAGAC CGG GAAAGACCATAC GT C CC CGGGCAG GGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GATGTAG CCACCT CCTAAC CTT C CCTT CTT GAAC C CC CAGTTAT GCCAGGATTTACTAGAGAGT GTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACTT GC CTGAGGGTGGACCAGAAGAAAGGAAAGGTC CCCT C TTGCT GTT GGCTGCACAT CAG GAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAG CT GC T CT GT GTGGT GGTTAACT C CAAGAGGCAGAACT CGT T CT AGAAGGAAAT G GAT GCAAGCAG CT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACT GCCCAGGGC CAACAGCC CCAC GGGGAAGGAGGGGTAC CAGGCCGT CCTT CAGGAGTG GGAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
CCTT T CGAT AGCT T TACT CTACAGAAGGTGTAC CAGCT GGAGACT GGCCT TAC C CGC CACCCCGAGGAGAAGC CT GT G 
AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
CCCAATCAC C GT CC TT ACACGGCCT CT GAT TT CATAGAAGGGAT CTACC GAACAGAAAGGGACAAAGGGACATT GTAT 
GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
GT GAAAAAT GAAAAGCT CAACATGGCCAACACG CTTAT CAAT GTTAT CGT GCCT CTAGCAAAAAGGGT G GACAAGTT C 
CGGCAGT T CATGCAGAAT TT CAGGGAGATGTGCATT GAGCAGGAT GGGAGAGTC CAT CT CACT GTTGTT TACTT T GGG 
AAAGAAGAAATAAAT GAAGTCAAAGGAAT ACT T GAAAACACT T CCAAAGCTGC CAACT TCAGGAACTTTACCTTCAT C 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GTAT TTTAT C CAGT T CTT TT CAGT CAGTACAAT C CTG GCATAATATACG GCCAC CAT GAT GCAGT CCCT CCCT T G GAA 
CAGCAGCTGGTCATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGAC 
TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
CACAGCAACCTCATAGT GGTACGGACGCCT GT GC GAG GACTCTT CCACCT CT GGCAT GAGAAGCG CT GCATGGACGAG 
CT GACCC CC GAGCAGTACAAGAT GT GCAT GCAGT CCAAGGCCAT GAACGAGGCAT CCCACGGC CAGCT G GGCAT GCT G 
GT GT T CAGG CAC GAGATAGAGGCTC^CCTT CGCAAACAGAAACAGAAGACAAGTAGCAAAAAAAC ATGAA CT C CCAGA 
GAAGGAT T GT GGGAGACACT TT TT CTT T CCTTTT GCAAT TACTGAAAGT GGCT GCAACAGAGAAAAGAC T TCCATAAA 
GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
AATCAAAAT CTCCGCTTTGCCT GCAAAAGTAAC C CAGTT GCACC CTGT GAAGT GT CT GACAAAGG CAGAATGCT T GT G 
AGAT TATAAG CCTAATGGT GT GGAGGT TTT GAT GGTGTT TACAATACACT GAGACCT GT T GT TTT GT GT GCT CATT GA 
AATAT TCAT GAT TTAAGAGCAGT TTTGTAAAAAAT T CAT TAGCAT GAAAG GCAAGC ATATTT CT CCT CATAT GAATGA 
GCCTATC^GC^GGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGAC CAGAAAAGAAAAGAAACCATAAATAT CGT GTCAT ATT TT C CCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
ATTT TTATAT TTTTTAAGAAGATACTTTGAGATGCATTAT GAGAACTTTCAGT T CAAAGCAT CAAAT T GATGC CATAT 
C CAAGGACAT GCCAAAT GC T GAT T CTGTCAGGCACT GAAT GTCAGGCATT GAGACATAGGGAAGGAAT G GTT T GT ACT 
AATACAGAC GTACAGATACTTT CTCT GAAGAGTATTTT C GAAGAGGAGCAACT GAACACTGGAGGAAAAGAAAAT GAC 
ACTTT CT GCTTT ACAGAAAAGGAAACTCAT T CAGACT GGT GATAT CGTGATGTAC CTAAAAGT CAGAAAC CACATTT T 
CT CCT CAGAAGT AG GGAC C GCTTT CTTACCT GT TTAAATAAAC CAAAGTATACC GT GT GAACCAAACAAT CT CT TTT C 
AAAACAG GGT GCTC C TC CT GGCTT CTGGCT TC CATAAGAAGAAAT GGAGAAAAATATATATATATATAT AT ATATT GT 
GAAAGAT CAATCCAT CT GC CAGAAT CT AGT GGGAT GGAAGTTTTT GCTACAT GTTAT CCACC C CAGGC CAGGT GGAAG 
T AAC T GAAT TAT T TT T T AAAT TAAGC AGT T CTACT CAAT CAC C AAGAT GCT T C T GAAAAT T GCAT T TT AT TAC CAT T T 
C^\AACT ATTT TT TAAAAATAAATACAGTTAACATAGAGT GGTTT CTT CAT TCAT GTGAAAATTATTAGCCAGCAC CAG 
AT GCATGAGCTAATTAT CT CTT T GAGT CCTT G CT T CT GT TT GCT CACAGTAAACT CATT GTTTAAAAGCTT CAAGAAC 
AT TCAAG CT GTT GGT GT GTTAAAAAAT GCATT GT ATT GATT T GTACTGGTAGTTTAT GAAAT TTAATTAAAACACAGG 
CC AT GAATGGAAGGTGGTAT TGCACAGCTAATAAAATAT GATTT GT GGATAT GAA 
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FIGURE 72 

MMMVRRGLLAWISRVWLLV 

E QHRNYVS S LKRQ I AQLKEE LQERS EQLRNGQ YQAS DAAGLGLDRS PPEKTQADLIAFLHS QVDK 
MWAGVKLATEYAAVPFDSFTLQKYYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TLIWIVPIJ^RVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVWTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 



APP ID=1 0063534 



Page 222 



FIGURE 73 

GAGACTGC^GAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAA.CCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGC AGAGATAAATAC ACT CACGCC AGGAGC T CGC TCGC TCTCTCTCTCTCTCTCT CAC T CCT C 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GCCCAGT CGCCCAT CGATATTCAGACAGACAGT GT GACAT T TGACCC TGAT T TGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACGCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCAC CT GCAC TGGGGT CAGAAAGGAT C CCCAGGGGGGT CAGAACACCAGATCAACAG 
TGAAGCCACATT TGCAGAGC TCCACATTGTACAT TATGACTC TGAT TCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCTTATGAACACAT TCTGAGTCACT T GCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T T CAAT GGAAC AG C T GGAAAAG C T T CAGG G GACAT T G T T C T C CACAGAAGAGGAG C C C T C T AA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAG T GT GGTC T TCACCT CAGCACAAGCCACGACTGAGGC ATAAA T T CCT T CT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACAC TGTAGGAGTAGTAAGCAGATGTCCTCCT TCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAAC TGCAGAGCCT TCAGCCTCTCCAAACAT GTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic -type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGA ATG TCCTACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGT TAAAGT TACTTACAC TGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
1"* ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

o 

fn CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
[y TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
U1 GAT TTTGAT T TATGGAAAT GAAT TT GACAAAAGAT TCT T T GT GCC T GCT GAAAAAATCGT GAT TA 

W ACT TTAT CACCC TCAATAT C T CGGATGAT TC TAAAATT TC TCATCAGGATAT GAGT T TACT GGGA 

Sp; AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
fn GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
lf| AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
HI ACAGT CAT T GAATATGAATAT GATG T CAGAACCAC TGACATT TG TGCGGGGCC T GAAGAGCAGGA 

^ GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 

SI TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

1 1| 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCAT CGACGACCC T GGT CGAC T GGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACC TATCT CAT GCAAT TCATGGAGGAAT GGGGGT T ATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISYVLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SEFKAKII FWYVLPI S I TVFLFSVMGYS IYRY IHVGKE KHPANL I L I YGNE FDKRF FVPAEK 
I VINF I TLN I S DDSKI SHQDMS LLGKS S DVS S LNDPQPS GNLRP PQEEEE VKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

*"f Important features: 

yl 

1% Signal peptide: 

fjl amino acids 1-28 

4 !B Transmembrane domain: 

'L, amino acids 140-163 

Q 

P| N-glycosylation sites. 

HI amino acids 71-74, 80-83, 89-92, 204-207, 423-426 

%*$ 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC T GAAGGAC CACAACGCCACCAGCATC C T GCAGCAGCT GCCGC TGC T CAGTGCCAT GC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GATCAATGAAAAGGCT GCAGATAAGCT GGGAT C T ACCCAGAT CG T GAAGATCCT AACT CAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T TACACCAAAGGT GACCAAC TTATAC TCAAC TT GAATAACATCAGC TC TGAT CGGATC CAGC TG 
ATGAACTCTGGGATTGGCTGGTTGCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAI^REK 
PAGGIPVLGSLVNTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMY7VDLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KW FNN S AAS L TMP T L DN I P FS L I VS QD WKAAVAAVL S P E E FMVL L DS VL P E S AHRLKS S I GL IN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
' KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 

iPI 

5f FE AAE S S L TKDAL VL T PAS L WKP S S P VS Q 

iJ 

fij Important features of the protein: 

yl Signal peptide: 

lit 

amino acids 1-21 

f-| N-glycosylation sites. 

Lfl amino acids 48-51, 264-267, 401-404 

M 

yf Glycosaminoglycan attachment site. 

CI 

» 3 amino acids -412-415 

ill 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCCT GGCAGAGAGACT C TGAAAT GAGGGAT TAGAGGT G T TCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GAC TAAT T T GT GCAT GAACT GAAATAAAACCAT C C TACGGT ATCC AGGGAAC AGAAAGCAGGATG 
C AGGAT GGGAGGACAG GAAG GC AG C C T GGGACAT T TAAAAAAAT A 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVi\MLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPADI QAAQAMMVTS SAI S SLAC 1 1 SWGMRCTVFCQESRAKDRVAVAGGVFFI LGGL 
LGFI PVAWNLHG I LRD FYS PLVPDSMKFE I GEAL YLG 1 1 S SL FS L I AG 1 1 LC FS CS S QRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

FMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTT.CCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
y, CCAT GGGAAGAGATACT CCCCCGGCGAGAGC T GGCACC CCTACT TGGAGCCACAAGGCC T GAT GT 

C3 ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
f J TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
|f| AGATCTTCAGTGCGCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

||| ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

m 

*r* AC TGC CAGAC TCCT GC T GCCAAGC C T GC AAAGAT GAGGCAAGTGAGCAATCGGAT GAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
r2 AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
n CAGACC CAAGGGAGCAGGCAGCACAAC T G T CAAGAT CGTCC T GAAGGAGAAAC ATAAGAAAGC CT 

f|| GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
iy CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C TCGGACT TGGTGGAGAT CT ACCTCTGGAAGCTGGTAAAAGATGAGGAAAC TGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGAC TT CCAGAAAGAGGCACAGCAC T TCCGAC TGCT CGC TGGC CCCCACGAAGGT CACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAACAGTTGCAGATATGAGC TGTATAAT TGT TGT TAT TATATAT TAAT AAA 
TAAGAAGT TGCAT T AC CC T CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
Uk SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

s 

Zi: Signal peptide: 

M amino acids 1-25 

il 

i 3 1 
J:: 



ry 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTT C CATGCCGT GAGGTCCATT C AC AGAAC ACAT CC ATGGCT CT C AT GCT CAGT TT GGTT CT GAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
. ; AAGGCAGG AC AAAACT GGTG AAGGAT TC T ATT GCGGAGGGGC GCAT CT CT CT GAGGCT GG AAAACATT ACT 

GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
Pi ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
|f| TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
|y AC AGACTC CAGGAC AAAC AGAG AC AT GC ATGGCCT GTT TGAT GT GGAGATCT CT CTGACCGTCC AAGAGAA 

HI CGCC GGGAGC AT AT CCTGT T CC AT GCGGC AT GCT CAT CTGAGC CGAGAGGTGGAATCC AGGGT ACAGAT AG 

W GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 



1*1 
Ik s 



T T TTTT GGC AT TGT TGGACT GAAG AT TT TCTT CTCC AAAT T CC AGT GG AAAATCCAGGCGGAACTGGACT G 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CC TC ACTCTG AGAAGAGATT T AC AAGGAAGAGTGTGGT GGCTT C TC AGAGT T TCCAAGC AGGGAAACATT A 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT AC GT GAC T TTGTCT CC CG ATCATGGGTACTGGGT CCT C AGAC T GAATGG AGAAC AT TT GTATTT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CC AGTCAC CC AGGAAT CAGAGAAAGAGG CCT C TT GGC AAAGGGC CT CT GC AATCCC AG AGAC AAGCAAC AG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACC GT CAGGAAT T CCCATCTC AC AG GCTGTGGT GT AGAT T AAGTAGAC AAGGAAT GT GAAT AAT GCT TAG 
AT CT T ATT GATGAC AGAGTGT AT CCT AATGGT TT GT T C AT T AT ATT AC ACT T TC AGT AAAAAAA 
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FIGURE 84 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
L YRDGKDQP FMQMPQYQGRTKLVKDS I AEGR I SLRLEN I T VLDAGLYGCR I S SQS YYQKAI WE LQ 
VSALGS VPL I S I T GYVDRDI QLLCQS SGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGL FDVE I SL 
T VQENAGS I S CSMRHAHLSRE VE S RVQ I GDT FFE P I S WHLATKVLG I LCCGL FFG I VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE YVTL S PDHGYWVLRLNGEHLY FT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGGTGCTGCCCCT 
GCTCT GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACT GC TGACGAT GCAGAG T T CCGT GA 
CGGTGCAGGT^GGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACC CACATACCAAGAAT T GCACCC TGAGCAT CAGAGAT GCCAGAAGAAGTGAT GCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAT^ATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAA.CCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCC TTGGGACT CGCGGGGACAGGAGGCCACTGACAC C GAGTAC TCGGAGAT CAAGAT CCACAG 
ATGAGAAAC TGCAGAGACTCACCCTGATTGAGGGATCACAGCCCC TCCAGGCAAGGGAGAAGTCA 
GAGGCTGAT TCTT GTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAAT TATG 
TGCAGAGTGAAAAGCACACAGGC T T TAGAGT CAAAGTAT CT CAAACCT GAAT C CACACT G T GCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTL I PQPQDHGT S LT CQVT FPGAS VT TNKT VHLNVS YPPQNL TMT VFQGDGT VS T VL GNGS SL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
T CATAGCGACCACCAGAGGAT GGAGTACAGATGAGGC TAAT ACT TAC TT CAAGGAATGGACCT GT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
fa GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
Q CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
,"\ AGAAACAGC TCCC T GCT GAGGTACCGCACGGACAC TGGC TT C CTC CAGACACTGGGACAT AAT CT 

|?| G T T TGGCAT C TAC CAGAAATATCCAGTGAAATAT GGAGAAGGAAAGTGT TGGACTGACAACGGCC 

L|| CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
4* GGCC AGC G GGAAT T CAC T G C G GGAT T T G T T C AG T T CAG G GT AT T T AATAAC GAGAGAGCAGC CAA 

J CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

•3 GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

1.1 1 

P| GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 

ft! T TGAG AGT TT TGT GGGAGGGAACCCAGACC T CTCC TC CCAACCATGAGAT CC CAAGGATGGAGAA 

P C AAC T TAC C CAG T AGC TAG AAT G T TAAT GGC AG AAG AG AAAAC AAT AAAT CAT AT T G AC T C AAGA 

iy AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCW T DNG PVI PWYD FGDAQKTAS Y YS P YGQRE FT AG FVQ FRVFNNE RAANAL CAGMRVT GCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 



Important features : 

a 



v-l Signal peptide: 



amino acids 1-16 

a.s..,s 

m N-glycosylation site. 

Ill amino acids 163-167 

Glycosaminoglycan attachment sites. 

J*~ amino acids 74-78, 289-293 

y i 

f|| N-myristoylation sites. 

O amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCGCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACT TATATACAGTT TTAGGGGACAATTAAAAAAAAAAAA 
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FIGURE QO 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

Q 
Q 

m 
iii 
in . 
w 

Q 

if! 
o 
ru 
o 
ru 
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FIGURE Ql 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
O CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
Iff CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
hi TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
4« CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

! TGGAAGGACAGG C G T G C C CAT GCAGGAGAC CATC T G GACAC C GGG GAG GGAAGG GGTTGGGCCTC 

ill 

a» AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

yi 

AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
f|| AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
P TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAG TGGCAACCC TGGGGT C T T T GAT G TC T T GACAGAT TGACCAT CT GT CT CCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 



III 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKI C S RDHAQS SAT WS CSQP FKWC VY I AFYS TDYRLVQKVCPDYNYHS DT P Y YPS G 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGAT GGACCAACACAGAAATATCTGC T GAT CT T T GGAGCGT TTGTCTCTGTC T AT AT CCAAGAA 
ATGTT CCGAT TT GCAT AT TATAAAG T C T TAAAAAAAGCCAGT GAAGGT T TGAAGAG TATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCC T GAAACT C T GCCTGC T CTGCCAAGACAAGAACT TTCTTCTT TACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
T T T TC T GAAAATCCCT T T TT C TGGTGGAAT TGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Qd 

MTAAVFFGCAFI AFGPALAL YVFT I AI E PLR 1 1 FL I AGAFFWLVS LL I S S L VW FMARVI I DNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFV1SFTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINIASAFIILVLMGTWAFIAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Qg 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC TACCAAAC CAACAGCAG TCAAATCAGGT CTTTCCTT C TT TAAG TC TGAT ACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
C CAGACCCACCCATTGACCCT GGGAGGGT T GAAT G TACAACAGCAACTGCACCCACATG T GT TAC 
|W= CAAT T T T TGTCACACAAC T T GGAGCCCAGGGCACTAT C CT AAGCT CAGAGGAATT GCCACAAAT C 

CI TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
S TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
Z\ GAACCC CAGCAGGCCGCCTCCCAACTC CCAG T GGCACAGAT GACGAC T T TGCAGT GACCACCCCT 

||| GCAGGCATCCAAAGGAGCACACAT GCCATCGAGGAAGCCACCACAGAAT CAGCAAAT GGAATT CA 

Uj G TAA.G CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
4* AT TGAT TAT AT TAT GGAATAGAT T GAGACACAT T GGATAGTC T TAGAAGAAAT TAAT TCT T AAT T 

% % TACCTGAAAATAT TCT TGAAATTTCAGAAAATATGTT CTAT GTAGAGAATCCCAACT TT TAAAAA 

IK CAATAAT TCAAT GGATAAAT CTGTCTTT GAAATATAACATT AT GC TGCC T GGATGATAT GCATAT 

Q TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
f|| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

o 
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FIGURE Q6 

MRS T I LLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSL I PLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DDDFAVT T PAG I QRS THAI EEAT TESANG I Q 

Signal peptide: 

amino acids 1-16 



ill 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACC ATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
. AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
W ACCAGCCT TGACCAC GCCC TGGAGAGGACC TAC CAGCT GT T GGTACAGG TCAAGGACAT GGGT GA 

U CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
|f| CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
Tfi CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
J*! TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
^| AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
UJ ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
j; CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
s ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
HI TC GAAGTCGCAGT CACAGAT ATCAATGATCACGCC CCT GAGT T CAT CAC T TCCCAGAT T GGGCC T 

Q ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
fU CC TCGAGCCCGCCT TCCGCCT CAT GGAT TTTGCCAT TGAGAGGGGAGACACAGAAGGGAC T T T T G 

l?- GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
y GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
III AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TC T GCCC T GGGGT GGAGGCACCATCAC CATCACCAGGCATGT C TGCAGAGCC TGGACACCAACT T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPG I P FL FLEAS DRDE PGTANSDLRFH I LS QAPAQP S PDMFQLE PRLGALALS PKG 
S T S LDHALE RT YQLLVQVKDMGDQAS GHQATAT VE VS 1 1 ES TWS LE P I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDP TSGSVTLGVLPLRAGQN ILLLVLAMDLAGAEGGFS S TCEVEVAVTD I NDHAPE FI TSQ I G 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPI SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 



||j Signal peptide: 

amino acids 1-18 



Transmembrane domain: 

amino acids 7 62-784 
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FIGURE QQ 



GGCTGACQGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTC CAAGAT T CT TCC CAGGAACACAAACGTAGGAGACCCACGCTCC TGGAAGCACCAGCCT T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT C TAGGACCCAGGCAT C T T GCT T TCCAGCCACAAAGAGACAG ATG AAGAT GCAGAAAGGAAATG 
TTC TCCTTATGT T TGGTCTACTAT TGCATT TAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAG TCAC CAAC T C TGAG T TCCATACAACC TC CAGT GGGATCAGCACAGCCACCAACT CT GAG 
T TCAGCACAGCG T C CAGTGGGATCAGCATAGCCACCAACTC TGAGT CCAGCACAACC T CCAGT GG 
N s GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
p| CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
pi AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
jS CAAC TC TGAC TC CAGCACAACC TCCAGT GGGGCTAGCACAGCCACCAAC TC T GAG T C CAGCACAA 

CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
W GCCACCAAC T CT GAGT CCAGCACAACC T CCAGTGGGGC CAGCACAGCCACCAACT CT GAGTCCAG 

If! AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
1 5 5 GCACAGCCACCAACT CT GAC TCCAGCACAGT GTCCAG T GGGGCCAGCAC T GCCACCAACT CT GAG 

P TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTC TGAC TCCAGCACAACCTCCAGTGGGGCCGGCACAGCCAC CAAC T 
3 CT GAGT CCAGCACAGTGTCCAGT GGGATCAGCACAGT CACCAAT TC TGAGTCCAGCACACCCT CC 

D AGT GGGGCCAACACAGCCAC CAAC TCT GAGT CCAGTAC GACC TCCAGT GGGGCCAACACAGCCAC 

|jf CAACT C TGAG TCCAGCACAGT GTCCAGTGGGGCCAGCACT GCCACCAAC TCTGAGT C CAGCACAA 

pi CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
Sf! GCCAC CAAC TCTGACT CCAGCACAACC TCCAGTGAGGCCAGCACAGCCAC CAACT CT GAGT C TAG 

W CACAGTGTC CAG T GGGAT CAGCACAGT CACCAAT T CT GAGT CCAGCACAACC T CCAGTGGGGC CA 

Q ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
fU ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGTCCTAACT GG T T CT GGAGGAGACCAGTAT CAT CGATAGCCATGGAGAT GAGCGGGAGGA 
ACAGCGGGCC CTGAG CAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCT GG T T TCCT T TCAT T CATC CCAGGAGACCC CT CCCAGCT T T GT T TGAGAT CC T GAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
TGCTCATTTAGCTAAGAAATAAATACATCTCATCTAACACACACGACAAAGAGAAGCTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE lOO 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGAJSTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGAS TATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
H TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 

W S T T S S GANTATNS G S S VT SAG S G TAAL T GMH T T S HS AS T AVSEAKPGG S LVPWE I FL I T LVSWA 

PI 

I:: AVGLFAGLFFCVRNSLSLRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 

m 

l A AMEMSGRNSGP 

u] 

ill Signal peptide: 

"T" amino acids 1-20 



Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGAT GTGATATATGCAT TCCAGGGGAAGGGAAAT T GTGGTGC T TC T GAAC CCAT GGT CAAT T 
AACGAGGCAGT T T C TAGC TACTGCACGTAC TTCAT AAAGCAGGAC T C TAAAAGCT T T GGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
H TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
^ CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

III GAT TATAAC TGGGGAT GCAT TTGTTCCT GGAGAAAGAAGT G T CAT TATCATGAACCAT CGGACAA 

IM 

§ , I GAATGGAC TGGATGTT CCT GT GGAATT GCCTGAT GCGATATAGC TACC TCAGAT TGGAGAAAAT T 

111 TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
W CT TCAT TCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTT TTGTG 

ATAT TCACGAACCACT T CAAC TCC TCATAT TCCCAGAAGGGAC TGATC TCACAGAAAACAGCAAG 
TC TCGAAGTAAT GCAT T TGCT GAAAAAAATGGACT T CAGAAAT AT GAATATG T T T TACAT CCAAG 
pi AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
O ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
rti CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
*J* CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT TT T TAT TT TACCGGACAGAGT GT CAT T CCACCT TGCAAGTCTGAAC TCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGC7\AGAGA 
GAATATT TGG T GGAC T GGAGATC ATAGAAC T T GCAT GT T AC C GAC T T T T ACACAAAC AGCC AC AT 
T TAAAT TCAAAGAAAAAT GAGTAAGAT TATAAGGT TT GCCAT GTGAAAACC TAGAGCATAT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCAT CAT TAT TT GT TAAAGATAT T T T GCAC T T AATT TT GT GGGAAAAATAT TGCTACAAT T TT T 
T TTAATC TC T GAATGTAATT T CGATAC TGTGTACATAGCAGGGAG T GAT CGGGGT GAAATAACT T 
GGGCCAGAATAT TAT TAAACAAT CATCAGGC T T T TAAA 
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FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKS FGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKYIITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI P PCKSELRVLWKLLS ILYWTLFSPAMCLLI YLYSLVKWYFI I T I VI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCC ATG GCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACC T C TACAGAGAT GGGGAAGAC TGGGAATC T AAGCAGAT GCCACAGTATCGAGGGAGAACT GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATC GGCCT GTAT GG GT GC T GGT T CAGT TC C CAGATT TAC GATGAGGAGGCCACCT GGGAGC T G 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
O AGGAT T T GTC T T CAGACT CCAGAGCAAAT GCAGAT GGG TACAGCC TGTATGATG TGGAGATCTCC 

p ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
ffi GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
™l TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
j*f AAATCCAAAGGGAAAATCCAGGCGGAAC TGGACT GGAGAAGAAAGCACGGACAGGCAGAAT T GAG 

111 AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAA.GCTCTGCG 
hi T T TC TGATC TGAAAACT GTAACCCATAGAAAAGC TCCCCAGGAGGT GCC T CAC TCT GAGAAGAGA 

*g TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
Q ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
Iff GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
pi TGACATGTCAG TT T GAAGGCT TGTT GAGACCCTATATCCAGCAT GC GAT G TATGACGAGGAAAAG 

Sj* GGGAC TCCCAT AT T CAT AT GT CCAG TGTCCTGGGGATGAGACAGAGAAGACCC T GC T TAAAGGGC 

{%* CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
CI CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
| : || TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCAT GGCT GCCC TGAAGT GGGGACGGAATAGAC T CACAT T AGG T T TAGT T T GT GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGATTCCTGCCTCACAGGTGAAGATTAAAGAGACAACGAATGTGAATCATGCTTGCAGGTT 
TGAGGGCACAGTG T T TGC T AATGAT GT GT T T T TAT AT TATACAT T T T CC CACCATAAAC T C TGT T 
TGC T TAT TCCACAT T AAT T TACT T T TC TC TATACCAAAT CAC CCAT GGAATAGT TAT TGAACACC 
T GC T T TGTGAGGC TCAAAGAATAAAGAGGAGGT AGGAT T T TT CAC T GAT TC TATAAGCCCAGCAT 
TACCTGATACCAAAAC CAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC TACAGGT CCATAT CC 
C T CAT TAACAC AGACACAAAAAT TC TAAATAAAAT TT TAACAAAT TAAAC T AAACAATATAT T TA 
AAGATGATATATAACTAC T CAG T GT GGT T TGTC CCACAAAT GCAGAG TT GGT T TAATAT T TAAAT 
ATCAACCAGTGTAAT T CAGCACAT TAATAAAGT AAAAAAGAAAAC CATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVL I LVLS FYELVS GQWQVTGPGKFVQALVGEDAVFSCSLFPE TSAEAMEVRFFRNQ FHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPL I S I VGYVDGGI QLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVE I S I 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRIASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWAS QGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLS PNNGYWVLRLTTEHLY FT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE los 



CCTTCACAGGACTCTTCATTGCTGGTTGGC AATGA TGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT TC AT T ATGTG AGAT AT AATC AAAAGAAGAC CT AC AATT ACTAT AGC AC AT TGT C ATT T AC AAC 
TG AC AAAC TATATGCT GAGT T T GGCAGAGAGGCT TCT AAC AATT TT AC AGAAATGAGCCAGAGACT TG AAT 
C AAT GGT G AAAAAT GC ATT T T ATAAATCT CCATT AAGGGAAGAATT T GTC AAGTCT CAGGTT AT C AAGTT C 
AGTC AACAGAAGCATGGAGT GT TGGCTC AT AT GCTGT T GAT TTGTAGATT T CACT C TACT GAGG AT C C TGA 
AACT GT AGATAAAATTGT TCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGT AGGACCCCCTAAAGTAG 
j£* AT CC TC AC TC AGTT AAAAT T AAAAAAAT C AACAAGAC AGAAACAGAC AGCTATCT AAACC AT TGCT GC GGA 

III AC AC GAAGAAGTAAAACT CT AG GTCAGAGTCT CAGG AT CGT TGGTGGG ACAGAAGT AGAAGAGGGT GAATG 

S GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
I , § TT GT GAGT GC TGCT CACT GT TT TACAAC AT AT AAGAAC CC T GCCAG ATGGACTGCT TCCT T T GGAGTAACA 

SK ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
||| TG AC TAT GAT AT TTCT CT T GCAGAGCTT TCT AGCCCT GTT CCCT AC AC AAAT GC AGTACAT AGAGT TT GT C 

TCCC TG AT GC AT CCT ATG AGT TTC AACC AGGT GATGT GAT GT TT GT GACAGGATT T GGAGC ACT GAAAAAT 
3 GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGC^ATG7^ACC 
M TC AAGCT T AC AATGACGCCAT AACTCCT AGAATGTT AT GT GCTGGC T CCT TAGAAGGAAAAACAGATGCAT 

GCCAGGGT GACTCTGG AGGACC ACTGGT TAGT TC AGAT GC TAGAGAT ATCTGGT AC CT TGCT GGAAT AGT G 
AGCT GGGGAG ATGAAT GT GC GAAACCC AACAAGCCTGGTG TT T AT ACT AGAGTT AC GGCCT T GCGGGACT G 
GATT ACT T CAAAAACT GGT ATC TAAGAGACAAAAGCC T CATGGAAC AG AT AAC AT T TT TT T TTGTT T T T T G 
GGTGT GGAGG CCAT TT TT AGAG AT AC AGAATT GG AGAAGACTTGCAAAACAGCT AG AT TT GACT GATCTC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACT CT GTCATCT GTG AGCAAT AGTT GAAAC TT T AT GT AC AT AG AGAAAT AGAT AAT AC AAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GC AT AT AT ACAATT TG AAGC ACTCCT TT TCT T CAGT T CCTC AGCTC CTCT CAT TTC AGCAAAT AT C CATTT 
TC AAGGTGCAGAAC AAGG AGTGAAAGAAAAT AT AAGAAGAAAAAAATCCC CT AC AT TT T ATT GGCACAGAA 
AAGT AT T AGGT GTT TT T CTTAGTGGAAT ATT AGAAAT GAT C AT ATT CATT AT GAAAGGTCAAGC AAAGAC A 
GCAGAATACCAATCACTTCATCATTTAGGA&GTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AAT TAT AC AAAC T TCAT GC AATGT ACT TG TT CT AAGC AAAT T AAAGC AAAT AT TT AT TT AACATT G 
T T ACTGAGGATGT CAACATAT AAC AAT AAAAT AT AAAT C ACCC A 
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FIGURE 106 



MM YR P D WRARKRVC WE PWV I GL V I F I S L I VLAVC I G L T VH Y VRYNQKKT YN Y Y S T L S FT T DKL Y 
AEFGREASNNFTEMSQRLESMVKMAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TE VEEGE W PWQAS L QWDG SHRCGATL I NATWL VS AAHC FT T YKNP ARW T AS FGVT I KPS KMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DATTCNEPQAYNDAI TPRMLCAGS LEGKTDACQGDS GGPLVS S DARDI WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 {type II) 



APP_ID=1 0063534 



Page 256 



FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT TCT CAC T T T AACTGCAGAAACGAGGGTAGAGGAAGCAG T CAT T TTGACT T AC T TT CCT 
. GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
t;';' GATAT GGT CACT T T GAAAGCCAGGATGACAAAT TAT GGAT TACCTAGATATC GG T GGCT TACT CA 

Q T GC T T GGAAT TT T T T T CAGAGAGAGT T TAAGT GC T GTGGAGTAGTATAT T TCACT GACT GGT T GG 

O AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
fp CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
?l TT TGAGAGGAACCAAACAACT GCAGGT GCT GAGGT T TC TGGGAAT C TCCATT GGGGT GACACAAA 

fi TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
Uj ACAGAC CAAATGAT GT CC T T GAAGAAT GACAACT CTCAGCACC TG T CAT GTCCC T CAGTAGAACT 

Uj GT T GAAACCAAGCCTGT CAAGAAT CT T T GAACACACAT CCAT GGCAAACAGC T T T AATACACACT 

J* TT GAGAT GGAGGAGT TATAAAAAGAAAT GT CACAGAAGAAAACCACAAAC T T GT T T TAT T GGACT 

" T GT GAATT T T TGAGT ACAT ACTAT GTG T TT CAGAAATATGTAGAAATAAAAATG T T GCCATAAAA 

!L TAACACCTAAGCATATAC T AT TCTATGC TT TAAAATGAGGAT GGAAAAG T T TCATGT CATAAG T C 

y ACCACCTGGACTyVTAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
If! TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
p. TCCATGCAAACGAGTCACATATGGT GGGAC T G GAG CCAT AGTAAAGGT T GAT T T AC T TCTACCAA 

fti CTAGTATATAAAGTACTAAT TAAAT GCTAACATAGGAAGT TAGAAAATAC TAATAACT T T TAT TA 

lit CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
W AC C TAAAT G T GAT TTTTGCTGGT T AC T AAAAT AT TCT T AC CAC T T AAAAG AGC AAG C T AAC AC AT 

fll TG TC TTAAGC TGAT CAGGGAT T T T T TG TATATAAGTCT GTG T TAAATC TGTATAAT T CAGT CGAT 

T T CAG T TC TGATAAT GT TAAGAATAACCAT TAT GAAAAGGAAAAT T TGTCC TGTATAGCATCATT 
ATTTTTAGCCTTTCCTGTT7\ATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGTTATTTAAATACTTAACCACTAATTT TGAAAAT TACCAGT GTGATACATAGGAATCATTAT TC 
AGAAT G TAGT CT GGT CT T T AGGAAGTAT TAATAAGAAAAT T T GCACATAAC T TAGT T GAT T CAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCTTATCTTTGCCTTCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAA 
TAGTGT TC T T TT TCTCCAGAAAAATGC T TG TGAGAAT CAT TAAAACAT GT GACAAT T TAGAGAT T 
CTTTGTTT TAT T T CACT GAT T AAT AT AC TG T GGC AAAT T ACACAGAT TAT TAAAT T T T T T T ACAA 
GAGTATAGTATAT T TAT T TGAAAT GGGAAAAGT GCAT T T T AC TGTATT T T GT GTAT T T T GT T TAT 
T TC TCAGAATAT GGAAAGAAAAT TAAAATGT G T CAAT AAAT ATT T T C TAGAGAG TAA 
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FIGURE 108 

MAREDSWCLRCLLYALNLLFWLMSI^ 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVEIACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 



I'll 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT C T T T CTC GTGAGAGC C T AGAGGC C TT AAAAAAAAAAGT GCTT GAAAGAGAAG GG GAC AAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTG ACCACC AGTT GGAT GAGTCT CAAGGTCCTCCTCTATGT GAC AACC AT GT GAAT GGGGAG 
T GGT ACC ACT T C ACG G GC AT GGC GGGAGAT GC CAT GC C T ACC T TC T GCAT AC C AGAAAAC C ACT GT GG AACC C A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCT GCTCAGATACCAGC GAGT GCAC AT GC GC T C CAGGAACT GTGCT AGGCCCT GAC AG GC AG AC AT GCTT T G 
AT G AAAAT GAAT GT G AGC AAAAC AAC G GT GG C T GC AGT GAGAT CT GT GT GAACC TC AAAAAC T C CT AC C GCT GT 
GAGT GTGGGGTTGGC CGTGTGCTAAGAAGTGATGGCAAGACTT GT GAAGACGT TGAAGGATGCC ACAAT AACAA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCT C AAGAC AT GT GGT AC AGT GGT C GAT GT G GT GAAT GAC AAG AT T GT GGC C AGC AAC C T C GT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
AC C T GC GAGT T T C CAC GCC T GT AC AC CAT T T C T GAAG GAT AC GT T C C C AACC T T C G AAACT CCC C AC T GGAAAT 
CAT GAGC C GAAAT C AT GGG AT CT T CC C AT T C ACT C T G GAGAT C T T C AAG GAC AAT GAGT T T GAAGAGC C T T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGT C T AC AGG GC C AGAC GCT AAC AGG C GGC C C GAT C C GCAT C GAC T GGGAG GACTAGT T C GT AGC CAT AC CTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CC C AGAAAGAC ACT C ACCC C AT T T C CC TC AT T T C T T T CCT ACAC T T AAAT AC CT CGT GT AT GGT GCAAT C AG AC 
CACAAAATCAGAAGCT GGGTATAATATTTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACTGAAATTAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAAT TTGGAAGT GTATCAATAAAACAGTATATAATTTT 
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■5:? 

uj 



HI 



MPPFLLLTCLFITGTSVSPVMiDPCSAYISLNEPWE^ITDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACAS FNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLY FGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 



II amino acids 1-16 



N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



ft I N-myristoylation sites . 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 



ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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CI 

ru 



GAGAGAGGCAGC AGCT T GCT CAGCGG AC AAGG AT GCT GGGCGTGAGGGACCAAGGC C T GCCC T GC ACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGAT CCTGAC AGT GAT CAACCT CT GAAC AGCCTCGATGTC AAACCC CT GC GC AAACCCCGTATCCC C 
AT GG AG AC CT T C AG AAAGGT GGGGAT CC CC AT CATC AT AG CACT AC TGAGCCTGGC GAGT AT CATC ATT GT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGG.AACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTAC AGC AGAGCT GT GG AGATTGGCCC AGAC CAGGAT CT GGAT GT TGTT GAAAT C AC AGAAAACAGCC AG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AGC ATCCAGT AC GAC AAACAGCACGT CTGT GGAGGGAG CATCCT GGACCC CC AC T G GGTCCT CACGGC A 
GC CC ACT G CT T C AGGAAACAT ACCGATGTGT T CAACTGGAAGGT GC GGGC AGGCT C AG AC AAACT GGGCAG 
CT TC CC AT CCCTGGCTGT GGCCAAGATC ATCATCATT GAAT T CAAC CC C ATGT ACC C C AAAG ACAATG AC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATG AGGAGCTCAC TC CAGCCACCCCAC TCT GGATC AT TGGATGGGGC TT T ACG AAGC AGAAT GGAGGGAA 
GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCC AAAGTCAGACAC AGAGC AAGAGT CCCC TT GGGT AC AC CC CT CT GC CCAC AGCCT C AGCAT 
TT CTT GGAGC AGC AAAGGGCCT C AAT TC CTGT AAGAGACCCT CG CAGC CC AG AG GCGC CC AG AGGAAGT CA 
GC AGCCCT AGCTC GGCCAC ACT TGGT GC TCCC AGCATCCC AGGG AGAGAC AC AGCC CACT GAACAAGGTC T 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
C AGATCAC TGT GGGCT GGAG AGGAGAAG GAAAGGGTCT GCGC CAGCCC T GTC CGTCTT C ACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
AC TG TT GT CAT TGTTAT T AC AGCT AT GG CCAC TAT TAT T AAAGAGC T GTGTAAC ATCT CT GGCAAAAAAAA 
AAAA 
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MLQDPDS DQPLNS LDVKPLRKPR I PMET FRKVG I P 1 1 1 ALLS LAS 1 1 1 WVL I KVI LDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVE I GPDQDLDWE I TENSQELRMRNS SGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS I LDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GS FPS LAVAK 1 1 1 1 E FNPMYPKDND I ALMKLQFPLT FS GT VRP I CLP FFDEE L T PAT PLW 1 1 GWG 
FTKQNGGKMS DI LLQASVQVI DS TRCNADDAYQGEVTEKMMCAGI PEGGVDTCQGDS GGPLMYQS 
?** DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

ff§ Transmembrane domain: 

|y amino acids 32-53 (typell) 

|f! 
hi 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACT ATGC TGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACC T GACCT T CAC T CT GGAACGAGAACAGAGGT T T CTACCCACACCGT CCCC T CGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
1^ GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
P CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
ll! AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
fi| GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ti! ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 

r. 

Ill CAGAAGGGGT CT GCTTAGACCACC TGGT T TAT GT GACAGGACT TGCAT T C TCC T GGAACAT GAGG 

4*- GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 

J s% GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

III CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 

fl CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 

f|| TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 

*** CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 

fit 

GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACAC GGAAAT GCCT CCAGT AAGCACAGGC T GCAAAAT CCCCAGGCAAAGGACT G T GT GG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCC7\AGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TT T GGGAGGC CGAGGC GGGTAGATCACC T GAGGT CAGGAGT T CAAGACCAGCC T GGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGC TGAGACAGGAGAAT TACT T GAACC T GGGAGGTGAAGGAGGC T GAGACA 
GGAG AAT CAC T T C AG C C T G AGC AAC AC AG C G AGAC T C T G T C T CAG AAAAAAT AAAAAAAG AAT T A 
TGGTTATTTGTAA 
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FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 



O 

a 
m 
W 
ill 
W 



o 

y! 

a 
rii 
o 
rii 
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FIGURE lis 

CAGCAGTGGT C TC T CAGTCCT CTCAAAGCAAGGAAAGAGTAC T GT G TGC TGAGAGACCATGGCAA 
AGAATCC TCCAGAGAAT TGT GAAGAC TGTCACAT TC TAAAT GCAGAAGC T TT TAAAT CCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
G T T T T G GGGGAGCAAG C AC T T C T G GC C GGAGG T AC C CAAAAAAGC C TAT GAC AT GG AGCAC AC T T 
TC TACAGCAATGGAGAGAAGAAGAAGAT TTACATGGAAATTGATCC TGTGACCAGAACTGAAATA 
TT CAGAAGCGGAAAT GGCAC T GAT GAAACAT T GGAAGTGCAC GAC T T TAAAAAC GGATACACT GG 
3 L CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
f ^ C T GAAC C AG AAGAG G AAAT AG AT GAGAAT GAAGAAAT TACCACAAC T TTCTTTGAACAGTCAGTG 

W ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
$\ GAT TTGTGATAACGTGACCATGTAT TGGATCAATCCCACTCTAATATCAGTT TC TGAGTTACAAG 

ACT T TGAGGAGGAGGGAGAAGATC T TCAC T T T CCT GCCAACGAAAAAAAAGGGATT GAACAAAAT 

is ?, 

| s | GAACAGTGGGTGGT CCCTCAAGTGAAAGTAGAGAAGACCCGT CACGCCAGACAAGCAAGTGAGGA 

«P AGAACTTCCAATAAATGACTATAC TGAAAATGGAATAGAATT TGATCCCATGC T GGATGAGAGAG 

* GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

Pi 

GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 

111 

Ji; TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 

jfjjf TGCT GGCAACATATAATAAATGCATGCTATT CAAT GAAT TTC TGCC TATGAGGCAT CT GGCCCC T 

Q GGTAGC CAGCT CT CCAGAAT TACT T GTAGGTAAT T CCTCTCT TCAT GT T C TAATAAACTT C T ACA 

ft! TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHI LNAEAFKSKKI CKSLKICGLVFGI LALTL I VLFWGSKHFWPE VPKKAYDffi 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



m 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
T ATTTC AC CAT CCT GGG ACT TCCAGC CATGCT GCAGGC AGTGCG AG CCCT GATG AT CG TAGGC AT C GTCCT 
GGGTGCCATT GGCC TCCT GGT AT CC ATCTTT GCC CTGAAAT GC ATC CGC AT T GGC AGC AT GG AGGACT CT G 
CC AAAGCC AACATGAC ACTG AC CTC C GGGAT C AT GTT C AT T GT CT C AGGT CT TT GT GC AAT T GCT G GAGT G 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
s , 1 GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
fjl TGGGTCC AACACCAAAAAC AAG AAGAT AT ACGAT GGAGGT GCCC GC AC AGAGGACG AGGT AC AAT CTT AT C 

bf CT TC C AAGCACGAC TAT GTG TAATGCTCT AAG ACCT CT CAGC AC GG GC GGAAGAAACT CCCGGAGAGCTC A 

*K CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
5 CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTG AGT T AT TT AT GAATT AGAGGCT AT AGCT CACAT T TT CAAT CC TCT AT T TCTT T T TT T AAAT AT AAC T 

If! 

TT CT ACT C TGATGAGAGAAT GT GGTT TT AAT CTCTCT C TC AC AT TT T GAT GATT T AGACAGACT CC CCCT C 
!;! TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
GAGT AG AC CCAAAG AT GTTATT T TCT GCTGT T TGAAT T TT GTCT CC CC ACCC CC AACT TGGC T AGT AAT AA 
AC ACTT AC TGAAGAAG AAGC AATAAGAGAAAG AT ATTT GT AATCTC TC CAGC C CAT GATCT C GGT T TTCT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT TGT CC C T AGATGAATG AGAAAATT AT T T TT TT T AAT T TAAGT CCT AAAT ATAGTTAA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAAT GAAAAAAT AAT T GCT TT GACATT GTC T AT ATGGT AC T TT GT AAAGT C AT G C TT AAGT AC AAAT TC C 
AT GAAAAGCTCACACC TGTAAT CCT AGC ACTT TGGGAGGC TGAGGAGGAAGGAT CACT TG AGCC CAGAAG T 
TC GAGACT AGCCT G GG CAACAT GGAGAAGCCCTGT CTC T ACAAAAT AC AG AG AG AAAAAATC AG CC AGT CA 
TGGT GGCAT ACACC TGT AGT CC CAGC AT T CCGGGAGGC T G AGGT GGGAGG AT CACT TGAGCC C AGGGAGGT 
T GGG GCTGCAGT GAGCCATGAT C ACACC ACT GCACT CC AG CC AGGT GACAT AGCGAGATCCT GT CT AAAAA 
AATAAAAAAT AAAT AAT GGAAC AC AGC AAGTCCT AGGAAGT AGGT T AAAACT AAT T CT TT AA 
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FIGURE 118 



MS T TTCQWAFLLS I LGLAGC I AATGMDMWS TQDLYDNPVTS VFQYEGLWRS CVRQSS G FTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLV^ 

CAIAGVS VFANMLVTNFWMS TANMYTGMGGMV QTVQTRYT FGAAL FVGWVAGGLTL I GGVMMC IA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



111 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGAT GCAGTGCAAAAT CTAT GAT T CCCT GCTGGC T CTT T CTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
s , AC C AGGT GC AC GG GG G ACAAT GAGAAGG T GAAGGC TC AC AT T CT G CT GAC G G C T G GAAT CAT CTT CAT 

CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
Q ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
HI CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 
T CTACTCCAGAAGTCAGTATGTG TAG TT GTGTATGTT TT TT TAACTT TACTATAAAGCCAT GCAAATG 
ACAA?^AATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
a '" AATCT TAAT TACAGGAACT GT GCAT CAGCTAT T TAT GAT TCTATAAG CTAT T T CAGCAGAATGAGATA 

Q TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
Iff CTCT T TTTATCAT TTACTTCAAAATGACATT GCTAAAGACT GCATTAT TTTACTACT GTAATTT CTCC 

Q ACGACATAGCAT TATGTACATAGAT GAGTGTAACATT TATATCTCACATAGAGACATGCT TAT AT GGT 

j.^ TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 

H ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 

III 

T TTATAATGAAGATTAAAATGAAGGCTT TAATCAGCATTGTAAAGGAAATTGAATGGCTTTCTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 
ATAT TTTAATAAGTAT TTGAGTACAGACTTT GAGGTTTCAT CAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACAT TTAT AT TAATAAAT TGTACAT TTT T CTAAT T 
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FIGURE 12Q 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCTOQANIRMQCK 
I YDSLLALSPDLQAARGLMCAASVM^ I FI ITG 

MWL I PVSWVANAI IRDFYNS I VNVAQKRELGEALYLGWTTALVLI VGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains : 

amino acids 82-101, 118-145, 164-188 



s 

Q 

fr : 



APP ID=1 0063534 



Page 270 of 3 18 



FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCC ATGC GACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAAT GT CTGAGGGAAAGC TT T GAGGAGT CC TGGACACCCAACTAC 
AAGCAGTGT T CAT GGAGT TCATTGAATTATGGCATAGAT CT TGGGAAAAT TGCGGAGTGTACAT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT TGAAGCTATAAT T TAT T T GGAC CAAGGAAGCCC T GAAAT GAAT TCAACAAT T AATAT T CATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T TGGCAC TT G T TCAGAT TACC CAAAAGGAGAT GCT T C TACT GGATGGAAT TCAGT T T CT CGCATC 
AT TAT T GAAGAACTACCAAA ATAAA TGCT T TAAT T T T CATT T GC TACC T CT T T T TT TAT TATGCC 
T T GGAATGG T TCAC T TAAATGACAT T T TAAAT AAG TT TATGT ATACAT C TGAAT GAAAAGCAAAG 
C TAAATATGT T TACAGACCAAAGTG TGAT T T CACACT GT T T T TAAAT C TAGCAT TAT T CAT TT TG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCT T TGTACAAT TT GTAAAT GT TAAGAAT T T T T TT TATAT CT GT 
TAAAT AAAAAT TAT T T CCAACA 
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FIGURE 122 

MRPQGPAAS PQRLRGLLLLLLLQLPAPS SASE I PKGKQKAQLRQREWDL YNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide : 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCC TGGAAGAATACATCATG T T T T TCGATAAGAAGAAAT TGTAGGATCCAG TTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCGATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATGTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GC C C C T AAGC C AAAG C AAAAG AC C T AAGGAC G AC C T T T G AAC AAT AC AAAG GAT GG G T T T C AAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
\»V CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
r*l ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
f9% ATAACAGCC T T CAAAAACT TAAGTATAATCAAT TTAAAGGGC T CAACCAGCTCAC C T GGCTATAC 

J! CTTGACCATAACCATATCAGCAATATTGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGA 
y*] GC TGAT TCT TAG T T CCAATAGAATC TCC TAT T T TC TT AACAATACC TT CAGACC T GT GACAAAT T 

ly TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
10 CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
1*1 AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
*l TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
"r" CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGT CCTGGACCTGGAGCT CC T TACAAAGGCT TGAT T TATCAGGCAAT GAGA 
p TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
$f% TCCAACAAGC T CACAT T TAT T GGTCAAGAGAT T T T GGAT TCT T GGATAT CCCTCAAT GACATCAG 

Xjj TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
»J TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
fl| ATCGAT GCAGT GAAGAAC TACAGCATCT GTGGCAAAAGT ACTACAGAGAGGT T TGAT CT GGCCAG 

Q GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
jaj TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
. § k ' TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGT CCC TAAAGCAAAT GAC T CC CAGCACCCAGGAAT TT TATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGAT GCT GCT GAAT GGGAC GGGACCCTGCACCTA 
TAACAAATCGGGC T CCAGGGAGT GTGAGG TATGAAC CAT TGT GATAAAAAGAGC TC T TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT TT TCCT C T CATAC AT AAT CAAC CCAT T GAAAT T TAAATACCACAAT CAAT GTGAAGCT T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
G T TT TAAGATAAAACT T C T T T CATAGG TAAAAAAAAAAA 
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FIGURE 124 



m 



W 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNS LQKLKYNQ FKGLNQLTWL YLDHNH I SNI DENAFNGI RRLKEL I LS SNRI S YFLNNT FR 
PVTNLRNLDLS YNQLHSLGSEQFRGLRKLLSLHLRSNSLRT I PVRI FQDCRNLELLDLGYNRIRS 
LARWFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS I CGKS TTERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 



« Transmembrane domain: 

amino acids 420-4 42 

Jes N-glycosylation sites. 

Ill 

p| amino acids 126-129, 357-360, 496-499, 504-507 

III 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 



Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites . 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCGCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
*f£ AT GAGCAT TATCCCCT T T GGAAATCAGT CAT T GGAGGGATGATGGC TGGT GT TAT T GGC CAGT T T 

y TTAGCCAATCCAAC TGACCTAGTGAAGGTT CAGATGCAAAT GGAAGGAAAAAGGAAACT GGAAGG 

JjJ AAAACCATT GCGAT TT CGT GG TGTACAT CAT GCAT T T GCAAAAAT C T TAGC TGAAGGAGGAATAC 

y GAGGGC T TT GGGCAGGCT GGG TACCCAATATACAAAGAGCAGCAC T GGTGAATAT GGGAGAT T TA 

4f| ACCAC T TAT GATACAG T GAAACAC TAC T TGG TAT T GAATACACCAC T TGAGGACAATAT CATGAC 

W TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAAT AAT GAAT CAAC CACGAGAT AAACAAGGAAGGGGAC T T T TGTATAAAT CAT CG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 



m 



Ill 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMTOTALGIIEEEGFLKLWQGVTPAIYRHVWSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANPT DLVKVQMQME GKRKLE GKP LR FRGVHHAFAK I LAEGG I RGLWAGWVP 
NIQRAALVIMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 



-, :. :; 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACAT CCT C T CCAT GT T C T CAGC TAT C CG T T C T CAGCACAGC GGT G T AGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
HI GGACAT GT T CAAT GT GAACGT GC T GGCC CTCAGCATCT GCACACGGGAAGCC TACCAGT C CAT GA 

S AGGAGC GGAATG T GGAC GATGGGCACAT CAT TAACATCAATAGCAT G TC TGGC CACCGAGT GT TA 

|f| CCCC TG T CT G TGACCCAC T T CTATAGTGCCACCAAGTATGC C G T CAC TGCGC T GACAGAGGGAC T 

III GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT T C GCCT T CAAACTCCACGACAAGGACCCT GAGAAGGCAGC TGCCACC TAT GAGCAA 

UJ 

w ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGAT T GGAGACAT CCAGAT GAGGCCCACGGAGCAGG T GAC C TAG TGAC T GTGGGAGCTCC 
|!| TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
til CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
5j TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
ll\ TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
p| TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCT GGCCCAGT GGAT T TCATGGT GAT CAT TAAAAAAGAAAAAT CGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLPCWGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLL^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 





APP ID-10063534 



Page 278 of 318 



FIGURE 12Q 



AACTTCTACATCGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATTTTTAACTGATAGTTGTACATATTTGGGGGTACATGTGATATTTGGATACATGTATACAA 
H TATATAATGATCAAATCAGGGTAAC TGGGATATCCATCACATCAAACATT TAT TTTTTATTCTTT 

O TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
*M CTCTGCCTGCCAGGTTCT^AGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
J?f GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
JZ GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
* ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
M ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
jf.? AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
JTs AGTAAGAAAATGCAATAT TT GT C TT TC T GTGC CT GGCT TAT T T CAC T TAACATAAT GAC T T CC TG 

PJ T T CCAT CCAT GT T GCT GCAAATGACAGGAT T T CGT TCT TAAT T TCAAT TAAAAT AAC CACACATG 

III GCAAAAA 



hi 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGE S PMP FKRVFCQDGNVRS FCVCAVH FS S HQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 8 6-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lftl 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCATGATCAGCGCAGCCTGGAGC 
AT CT T C CT CATCGGGACT AAAATTGGGC TGT T CC TTCAAGT AGC ACCTCT AT CAGT TATGGCT AAAT C CT G 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC CAGAGG AT GC TAC AAC TCTCT ACCT T CAGAACAACC AAAT AAAT AATGCT GGGATTCC TTC AGAT 
TTGAAAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 
CCC AAAGT AT GT AAAAGAGT T AC ATT TGC AAG AAAATAACAT AAGG ACTATC ACT TAT GATT C ACT TT C AA 
AAAT TC CC TAT CTGGAAGAATT AC AT TT AGAT GACAACTCT GTCTCT GC AGT T AGC AT AG AAGAGGGAGCA 
H TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
^ GCCC AGG ACT AT AG AAGAACT AC GCT TGGATGAT AATC GCAT AT CC ACT ATT TC AT CACC AT CT CT T C AAG 

::f GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 

rn 

TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

III 

»5?= TCC AGGC AC AAACC TG AGGAAGCTTT AT CTTC AAGAT AACCACATC AATC GGGTGC CCCC AAAT GCT T T T T 



in 



CT T ATCT AAG GCAGCT C TAT CG ACTGGATATGTCC AAT AATAACCT AAGT AATTT ACCTC AGGGT ATC T T T 
GAT GATT T GG ACAATAT AACAC AACT GATTCT TCGC AACAAT CCCT GGT ATT GCGGGT GC AAGATG AAAT G 
GGT ACGT GAC TGGT TAC AAT CACT ACCT GT GAAGGTC AAC GTGCGT GGGCTC ATGT GCCAAG CCCC AGAAA 
AGGT TCGT GG GATGGCT ATT AAGG AT CT CAAT GC AG AACT GTTT GAT TGT AAGGAC AGTGGGAT TGT AAGC 
ACC ATTC AGATAAC C ACTGC AAT ACCCAAC AC AGT GT ATC CTGCCC AAGG AC AGT GGCC AGC TCC AGT GAC 
CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAAT TAC AAT TACT GT GAAGTCT GTCAC CT CT GAT ACC AT TC AT AT CTCT TGGAAACTTGCT CT AC CT AT G 
ACT GCT T T GAGACT CAGCTGGCT TAAAC TGGGCC AT AGCC CGGC AT T TGG AT CT AT AACAGAAACAAT TGT 
AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGT ACAACCCT AC AACCACCCT CAATCGAG AGC AAG AGAAAG AAC CTT AC AAAAACCC CAAT T T ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AAT GG ATC G C TCTT C TCAAGGAAC TGT GC ATATAGC AAAGGGAGGAG AAGAAAG GATG AC TAT 
GC AG AAGC TGGCACTAAGAAGGAC AACT CT AT CC TGGAAATC AGGG AAACTT CTT T TCAG AT GT T ACC AAT 
AAGC AAT G AACCC ATC T CGAAGGAGGAGTTT GT AAT AC AC ACC ATAT TTCCT CCT AAT GG AATGAATCT GT 
AC AAAAAC AAT C AC AGTGAAAGC AGT AGT AAC CGAAGC TACAGAGAC AGT GGT AT T CC AG ACTC AG AT C AC 
TC AC ACT C ATGATGCT GAAGGAC T CACAGCAGAC TT GT GT T TTGGGT TTT TT AAAC CT AAGGGAGGTG AT G 
GT 
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FIGURE 132 



MI SAAWS I FL IGTKI GLFLQVAPLSVMAKSCPSVCRCDAGFI YCNDRFLTS I PTGI PEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLS'RNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVP PNAFS YLRQL YRLDMSNNNL SNLPQG I FDDLDNI TQL I LRNNPW YC GCKMKWVRDWLQS LPV 
KVNVRGLMCQAPEKVRGMAI KDLNAEL FDCKDS G I VS T I QI T TAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQ T TGSPSRKT I T I TVKS VTSDT IH I S WKLALPMT ALRLS WLKLGHS PAFGS ITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
H PYKNPNLPLAAI I GGAVALVT IALLAL VCW YVHRNGS L FSRNCAYS KGRRRKDDYAEAGTKKDNS 

|M ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

i s ? 

r ? * Important features of the protein: 

ill 

g Signal peptide: 

$ amino acids 1-28 

M Transmembrane domain: 

amino acids 531-552 

;f""s 

N-glycosylation sites. 

f*J amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

Ill Tyrosine kinase phosphorylation site. 

amino acids 515-522 

N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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figure iaa 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACGCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGAC TCAT T C TCAGAGGAGAGAAGTGACGT GT GCC TGGTGCAGCT GC TGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
AGAGCT GAGGCCAT CGGAT ACGCC TACCC TACCCGGGACATC T TCATGGAAAACAT CATGT TCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
TATCAGCAGCATT TT TCGAGGAGAGTGAAGAGGCGAGAAAAACAATT TCCAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGT T CCAT GC CAGCT GCC TGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGC CGGGAGGGACCCAGGT GT GAACGGATGAAT7VAAG T T CAAC TGCAAC T GAAAAAAAAAAA 
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FIGURE 134 

MS ARGRWEGGGRRACRGS LGLARAQGAERVT S SEQRPAMAS LGLLLLLLLTALPPLWS SS LPGLD 
TAE S KM I ADL I LS ALERATVFLEQRL PE INLDGMVGVRVLEEQLKS VREKWAQEPLLQPLS LRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCAM^MDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACC T TCTTACT T TATTAAAAGGGAAT CGTGGGGCT GGACAGAC T T TC TAAT GAACCCAAT GG T 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACAT GAGACGGGAAATGGAGCAGTCAATGAATAT GC TGAAT TCCAACCAT GAGT T GCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAATAT T GATGTCAC TGAAT TAAT TACAGT GT CCTATAGAAAAT GCCAT TAATAAAT TAT 
AT GAAC T AC TAT AC AT TAT G TAT AT TAAT T AAAAC AT C T TAAT C C AGAAAT C AAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE lft6 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMMVLPLLIFVLLPKVVNTSDPDMRREME 
QSMMMLNSNHE LPDVSE FMTRL FS S KS S GKS SSGSSKTGKS GAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
. , CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
f"| AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
D TGCAAGCCAAAGAT GGAAACAT T GACATCAGAAT CT TAAGGAGGAC TGAG TCT T TGCAAGACACA 

ff| AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
7*J AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
H J CCATCAAGAAGGACCTC CGGC TC T C TCATGCCCACATGACAT GCCAT TG T GGGGAGGAAGCAAT G 

J;: AAGAAATACAGCCAGAT T CT GAGTCAC T T TGAAAAGC T GGAACC T CAGGCAGCAGT T GTGAAGGC 

C TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
M CTGC TAAGAATAT T C GAGGT CAAGAGC TCCAGTC T T CAAT AC C TGCAGAGGAGGCAT GACCCCAA 

LP 

accaccatctctttactgtactagtcttgtgctggtcacagtgtatcttatttatgcattacttg 
fj» cttccttgcatgattgtctttatgcatccccaatcttaattgagaccatacttgtataagatttt 
Ft tgtaatatctttctgctattggatatatttattagttaatatatttatttattttttgctattta 
141 atgtatttatttttttacttggacatgaaactttaaaaaaattcac^vgattatatttataacctg 
actagagcaggtgatgtatttttatacagtaa?u\aaaaaaaaccttgtaaattctagaagagtgg 
ctaggggggttattcatttgtattcaactaaggacatatttactcatgctgatgctctgtgagat 
atttgaaattgaaccaatgactacttaggatgggttgtggaataagttttgatgtggaattgcac 
atctaccttacaattactgaccatccccagtagactccccagtcccataattgtgtatcttccag 
ccaggaatcctacacggccagcatgtatttctacaaataaagttttctttgcataccaaaaaaaa 
aaaaaaaaaaa 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEI RGS VQAKDGNI DIRI LRRTE S LQDTKPANRCCLLRHLLRL YLDRVFKN YQTPDHYT LRK I S 

SIANSFLTIKKDLRLSHAHMTCHCGEEAMK^^ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTGGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCAC C TCT C T TCAGTAAAG T TGT TAT TGT T C TGAT AGAT GCC T TGAGAGATGAT TTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACGTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCT GGAAAAAGAATAGT CT TT TATG GAGATGAAAC CT GGGTTAAAT TA 
T TC CCAAAGCATT T T G T GGAATAT GAT GGAAC AACC TCAT T T T TCGTGTCAGAT T ACACAGAGGT 
M GGATAATAATG TCACGAGGCAT T T GGATAAAGTAT T AAAAAGAGGAGAT T GGGACAT AT TAAT CC 

Q TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
f ; i AAGC TGAGC GAGAT GGACAGCGTGCTGATGAAGAT C CACACC TCAC TGCAG T CGAAGGAGAGAGA 

JjJ GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
f j GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
W CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
Ifl ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
|jj . CAATGAGAGAGCAG T T GAGAT T T T TACAT T TGAATACAG T GCAGC T TAG TAAAC T GT T GCAAGAG 

"S5 AAT G T G C CG T CAT AT GAAAAAGAT CCTGGGTT TGAG GAG T T T AAAAT G T C AGAAAGAT T GCAT GG 

"J* GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
* TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
Q TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Iff CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
p| CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
IZ GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
III AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
p TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
f|| CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC TCAGATCCACAGAGC C CAGGATCAAGGGAC CCAC TGCAGT GGCAGCAGGAC T GT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACAC T CACAGCT T TGTCATCAGGGCACAGGCT T CCTCGGAGCCAGGATGAT C T G T G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAGTATGTAG T TAC CAAAAGAATAAAC GGCAATAAT. TGAGAAAAAAAA 
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FIGURE 14Q 



m 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKAmTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWYKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide : 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 14.1 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACT^ACT 
TCCC TATAGAAAAC AAC T GCC AGCACC T TAAGACC AC T CACACC TTCAGAGTGAAGAAC T TAAAC 
CCGAAGAAAT TCAGCATTCATGACCAGGATCACAAAGTACTGGTCC TGGACTC TGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GTGACAAGGAT AAAGGACAAAGTCATC CATCCC T TCAGCT GAAGAAGGAGAAAC TGAT GAAGCT 
\l\ GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
F| ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
CP GTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCATTTCAACCAGTTTGCAA 

: _ s 

H:f AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
! :? J ATT TGAACTAAT TGTATAAAAACACCAAACC T GCTCACT 

%y 
*£« 

If! 
til 
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FIGURE 142 



MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 



N-myristoylation site. 

amino acids 50-55, 87-92 

Ul Interleukin-1 

*l amino acids 37-182 



in 



ry 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGAT GGCAGATGAG TGACT CCACATC CAGAGCTGCCT CCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAG T GGAGT T TGCTGTCCACACATT CAACCAACAGAGCAAGGACTACTAT GCC T ACAGACT GGGG 
CACATC T TGAATT CC T GGAAGGAGCAGGTGGAGT CCAAGACTGTAT TCTCAATGGAGC TACTGC T 
GGGGAGAACTAGGTGTGGGAAAT TTGAAGACGACAT TGACAACTGCCAT TTCCAAGAAAGCACAG 
AGCTGAACAATACT TTCACCT GCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGT T CAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATC TTGAGCCTAATCAT GTAGTGTAGATCATTAAACATCAGCATT TTAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



f 

111 



ru 



N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site, 
amino acids 9-15 
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FIGURE las 

CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
j£ CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
S TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
01 AAGC TTGCCAGGAGGCT GGCAG TACAGAGCGCAGCAGCGAGCAAAT CCT GGCAAGT GACCCAGC T 

W CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 

^ 1 TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

ill 

f» CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 

m.z 

X GAGAAGAACACAT CAGGCAC T GCGCCAC C TGC TT CACAGT AC T TCCCAACAAC T CT TAGAGGT AG 

Q GT GTAT TCCCGTTT TACAGATAAGGAAAC TGAGGCC CAGAGAGC TGAAGTAC TGCACC CAGCATC 

III ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

W ' GT CCAAT TCCAGAGC TGT CTGGT GATCAC T T TAT GT CT CACAGGGACCCACATCCAAACAT GTAT 

III 

CT CTAAT GAAATT G T GAAAGC TCCATGT T TAGAAAT AAAT GAAAACACC T GA 

m 

3 VST 
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FIGURE 146 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LD IDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGC INATQAANQGE FQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACT T T CAGAT T CAGGGGGTACAT GT GAAGGT T T GT T T TAT GAG TAT AT TGCATGATGC T GAGG 
TTTGGGGT 

u 

w 

C3 

m 



•w>S 

six is. 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myr i s toyl a ti on s i te . 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCT ACATAATAGAGAATT AAGT GCAGAAAGAC CT T TGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
T! T GTT GATAACTT GAAC CTGCTAAAGGCAATAACAGAAAAGGAAAAAAT TGAGAAAGAAAGACAAT CTA 

;7""T> 

III T AAGAAGCT CC C CACT TGATAATAAGTTGAAT GTGGAAGAT GT TGAT T CAACCAAGAAT CGAAAACTG 

%J 

H\ ATCGATGATTAT GACT CTACTAAGAGTGGAT T GGATCATAAAT TTCAAGATGATCCAGATGGTCT TCA 

||| TCAAC TAGACGGGACTCCT TTAACCGCTGAAGACATT GTCCAT AAAAT CGCT GCCAGGATT TAT GAAG 

|fl AAAAT GACAGAGCCGT GTT TGACAAGATT GT T TCTAAACTACT TAAT C TCGGCCT T ATCACAGAAAGC 

*y CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGT TT TACAAAAATTAATCT CAAAGGAAGCCAACAA 
T TAT GAGGAGGAT CCCAATAAGCCCACAAGCT GGACT GAGAAT CAGGCTGGAAAAATACCAGAGAAA.G 

I« T GACT CCAATG GCAGCAAT TCAAGATGGTCT TGCTAAGGGAGAAAAC GATGAAACAGTATCTAACACA 

tj 

gfl T TAAC CTTGACAAAT GGCT TGGAAAGGAGAACTAAAACCTACAGTGAAGACAACT TTGAGGAACTCCA 

%?§ 

H ATAT TTCCCAAAT TT CTAT GCGCTACTGAAAAGTATT GAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 



5 ! 2 = 

HI 



HI CACT GATTACTAT CAT GAAAACACT GAT TGAC TTT GT GAAGATGATGGTGAAATATGGAACAATATC T 

l»? C C AGAAGAAGG T G T T T C CT AC C T T GAAAACT T G GAT G AAAT GAT T GC T C T T C AGAC C AAAAAC AAGC T 

AGAAAAAAATGC TACT GACAATATAAGCAAGCT TT TCCCAGCACCAT CAGAGAAGAGTCAT GAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATAT GGAAGCT T GAAGGAT TCCACAAAA 
GATGATAACTC CAAC CCAGGAGGAAAGACAGATGAAC CCAAAGGAAAAACAGAAGCCTATT TGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
GAAAGATGAGAGACTT CAT CAATAAACAAGCT GAT GCTTATGT GGAGAAAGGCATCC T T GACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTT ATAGCAGCCT GTAAAAAT GGCAAAAGAT CCAGGAGT CT T TCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTAT TAGAAAGT GCT GAATT TACAGTAGTTAACC TT TTACAAGT GGTT AAAACATAGCT T TCT TCCC 
GTAAAAAC T ATC T GAAAGTAAAGTT GTAT GTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE iso 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMdVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 



It! N-glycosylation sites: 

W amino acids 68-71, 346-349, 350-353 



Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE lsi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATG GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCT TCTAGCT GGAGGGCT GCATGCAGGGAAGGTCAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAG TGCC T G T CATGT GGGGT GGGGCAGGAGCCGACT C TAACAC TAGAGCCAGT GAACAT CATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TG T CAGAC TCACCCAGC T TCCCGAGAATGGT GGCT GGAAT GCCCCCATCACAGACT T CTAC T TCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
H TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
Q CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
fi\ CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
:S TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
y ; GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
III GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
If! TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
J I GAAATGGCT C GAGCT CAGAAGATAAAAGATAAGT AGGGTAT GC TGAT CC TC TT T T AAAAACCCAA 

GATACAATCAAAAT CCCAGAT GCTGGT C T C TATT CCCAT GAAAAAGTGC TCAT GACATATT GAGA 
"fr* AGACCTACT TACAAAGT GGCATATAT T GCAAT TTAT T T T AAT TAAAAGATACC TAT T TATATAT T 

* TC T T TATAGAAAAAAGT C TGGAAGAGT T TAC T TCAAT T GTAGCAATGTCAGGGTGGT GGCAGTAT 

Q AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
i n AT T CC T T GT AT AAAAAT AAGAAAAGAAAT T AAT CT T G AGG T AAGC AGAGCAGAC AT CAT C T C T GA 

%l TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
W TTGTAG TAGTGAT CAGGAAACAGAT CT CAGCAAAGCCAC TGAGGAGGAGGC TGTGCT GAGT TT G T 

III GTGGC TGGAAT CT CTGGG TAAGGAACTTAAAGAACAAAAATCATC TGGTAAT T CT T T CCTAGAAG 

Q GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
f|| ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTC TCTGCAGATGTAGT TAGT TAAGACAAGG T CAT GC TGGATGAAGGTAGACCTAAAT TCAAT AT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGATCGGAGTT TTGCAGCCACAAGCTAAGAAACACCAAGGATT GTGGCAACC 
ATCAGAAGCT T GGAAGAGGCAAAGAAGAATT CT T CCC TAGAGGC T T TAGAGGGATAACGGCTCTG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT T GGTTACAGCAGC TCTAGGAAAC T AATACAGC TGC TAAAAT GATCCCT G T 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACT TCCAAGAT TAGGTTATAAAAGACAC TGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCC7\ATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC TAGC TGCCAT GC TATGAGCAGGCC TAT AAAGAGACT TAC G T GGTAAAAAATGAAGT C TCC T 
GCCCACAGCCACAT TAGT GAACCTAGAAGCAGAGAC T C T GT GAGATAAT C GAT G TT TGTT GT TT T 
AAG T TGC TC AGT T T T GGT C T AAC T T G T TAT GCAGC AAT AGAT AAAT AAT AT GCAGAG AAAGAG 



APP ID=10063534 



Page 301 of 3 18 



FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLIAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCC T TATATGCAGGAGGT GGT GCCCT TCC TGGCCAGGCT CAGCAACAGGC TAAGCACAT GT CA 
TAT TGAAGGT GATGACC TGCATAT CCAGAGGAAT GTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGTGGAGAGAT CAAAGCAAT TGGAGAACT GGAT T TGC T GT T TATGT C T CTGAGAAAT 
GCC TGCAT T TGACCAGAGCAAAGC TGAAAAATGAATAACTAACCCCC TT TCCC TGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGATTT TTT T TAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC T T TCTAAGCATAGATAT T TAT T GAT AACAT TTCATTGTAACTGGTG T TC 
a T ATACACAGAAAAC AAT T TAT T T T T TAAATAAT T GT C T T TT TCCATAAAAAAGAT TACT T TCCAT 

0 TCC TT TAGGGGAAAAAACCCC TAAATAGC T TCAT G T T T CCATAATCAGTACT T TATATT TATAAA 

yf T G TAT T TAT TAT TAT T AT AAGAC T GCAT T T TAT T TAT AT CAT T T TAT T AAT AT G GAT T TAT T TAT 

JJ* AGAAACAT CAT T CGATAT T GC TAC T TGAGTGTAAGGC TAATAT TGATAT T TAT GACAATAAT TAT 

AG AGC TAT AACAT G T T TAT T T GAC C T C AAT AAAC AC T T G GAT AT C C C 



Jr. 

y 5 



ill 
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FIGURE 154 

MAALQKSVS S FLMGTLATS CLLLLALLVQGGAAAP I SSHCRLDKSNFQQPYI TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCH IEGDDLH I QRNVQKLKDT VKKLGE S GE I KAI GE LDLL FMS LRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT TAGGTGAGGACAGT TCTCTCATTAGCCTT T TCCTACAGGTGGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCT GCT CTACCACAACCAGACTGTC TTC TACAGGCGGCCAT GC CATGGCGAGAAGGGCAC CCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
= ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
O CAGGATCCCGGGACAGGATGGGGGGC T T T GGGGAAAACCT GCAC T T C TGCACAT T TT GAAAAGAG 

iff 

Zl CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
fk GCCCAGCACAGGCAC T T T CT AGATAT T T C CCC C T TGCT GGAGAAGAAAGAGC C CCTGGTTT TAT T 

f|| TGT T TGT T TAC T CATCAC TCAG TGAGCATC TAC T T T GGG T GCAT TC TAGT GTAGT TACTAG T CT T 

TTGACATGGATGAT TCTGAGGAGGAAGC TGTTATTGAAT GTATAGAGATT TATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 



MRERPRLGEDSSLISLFLQWAFLi^IWGTHTYSHWPSCCPSKGQDTSEELLRV?STVPYPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 



o 



N-glycosylation site. 



01 



amino acids 136-140 



Tyrosine kinase phosphorylation site* 



amino acids 127-135 



N-myristoylation sites. 



m 



amino acids 44-50, 150-156 
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FIGURE 1*7 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGACCT CC GAGTAGAACC TGT T ACAACTAGTGT TGCAACAGGGGAC TATT CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAGT CTATT T CAT TGGGGCCCAT AATAT T CC T AAT GCAAATATGAAT GAAGAT GG 
CCC TT CCAT G T C T GTGAAT T T CACC TCACCAGGC T GCC TAGACCACATAATGAAAT ATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCC TGGGAAACAGATACATGGCTCT TATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAAC7\AACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CAT GG GT GC T G GT G GC AGGGAT C TAT C T AAT G T G G AGG C AC GAAAGGAT C AAGAAGAC T T C C T T T 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GT GATC TAAGAAGCCAGAT T CATC T GCACAAATACGT GGT GGT C TAC T T TAGAGAGAT T GAT ACA 
AAAGAC GAT TACAATGC T CT CAGT G TC T GCCC CAAGT AC CAC CTCATGAAGGAT GCCACTGCT T T 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE is8 



w 

Sib?: 



r :;:; i 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSW 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNE DG P SMS W FT S PGCLDH IMK YKKKC VKAGS LWDPN I TACKKNE E T VE VN FTT T PLGNRYMAL IQ H 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPY FPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 2 90-309 



]|| N-glycosylation sites. 

O amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 

III - 287 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 



N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAACATGA.CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTAGACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGC T GCAT CAAT GC T CAAGGAAAGGAAGACAT C TCCATGAAT T CCGT TCCCAT CCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAA GAGGTGCATATCC 
AC T C AGCT GAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLS I LGLAFLSEAAARKI PKVGHT FFQKPESCPPVPGGSMKLDI GI INEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC TCC GTGC T GGAAG TAGGAGGAGAGT CAGGAC TCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAG ATGC CTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGAT GAGGAAAAGT T T GGAGGAGCAG C TGACTCAGGGGTGGAGGAGCCT AGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
H TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
□ TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
PI GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
III ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
r ! GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
UJ CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
If! CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
1 3 1 CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
3 AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
p CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
Ifl GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
S| CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
Jf! CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
I U CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
p CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
in CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVL1WS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
L WQ AARLRLLT LQSWLLDAP CS L P AE AALCWRAPGG D PCQ PL VP PL SWEN VT VDKVL E FPLL KG HPNLC VQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQ CL QL WD DDLG AL WAC PMDKY I H KRWAL VWLACLL FAAAL S L I LL L KKD HAKG WLRL L KQDVRS GAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGWVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 

GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
G T CAGTGC GGGAGG CCGGTCAGCCACCAAGATGACTGACAGG T TCAGC TC TC T GCAGCACAC TAG 
CC T CAAGCCACC T GAT GT GACCT G TAT C TC CAAAGT GAGAT CGAT T CAGAT GAT T GT TCAT CCT A 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TCTACCAC TTAGAGCT CCAGGTCAACCGCACCTACCAAATGCACCTTGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
^ CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
Q TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
Q CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
m TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
71 AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
*W T CCACAGCGGCATAGC CT GT CCGAGAT CACC T AC T T AGGGCAGCCAGACATC T CCAT CCT CCAGC 

Ul CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
US GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
J£ GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
V CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
C3 T TC TC TGCAGGAGG TGACCT CC T T GGC TATGGAGGAATC CCAAGAAGCAAAATCAT T GCACCAGC 

111 CCC TGGGGAT T T GCACAGACAGAACAT C TGACCCAAATG TGCTACACAGT GGGGAGGAAGGGACA 

p| CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
St CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
l;f TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
13 GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
f|| AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAGT GCAAGGAGGAAAT GCAGGGAAACT CCCGAGG T CCAGAGCC CCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCC TGGAAAAGAAC C AGGCC T GGAAAAGAACCAGAAGGAGGC T GGGCAGAAC CAGAACAAC CT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TT CCCAGCCAGGGCAAC TGC C TGACGTT GCACGAT T TCAGCT T CAT T CC T C TGATAGAACAAAGC 
GAAATGCAGGT CCACCAGGGAGGGAGACACACAAGCCT T T TC T GCAGGCAGGAGT T T CAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
C TCACCAC TCACAAGCT GTG T GAC T T CAAACAAAT GAAAT CAGT GCCCAGAACC T CGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 

EFLGT IMICVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFL FSMGFLVAVLCYLS YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATCG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGTT TTGGATATT TTCCAAAAGATT T GATCAAGGTACT TCATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACT TT GTCT GC TT TGAAGGAGGAAGAGATGAT T 
TTAATAGTTATAAT GTAGAAGAGCT TT TAGGATCT TTGGAACTGGAGGACTCTGTACCTGAAGAG 
m TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
f|l ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
III CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGC T TACAAAG T CCTGAAAACAGAAAT GAGT CAGAGAGGAAGT GGACAG T GCGT TAT T 
CAT TACAGCAAAGGAT T T CGT TGGCATCAAAATC TAAGT T T GTT T TACAAAGAT TGT T T T TAG TA 
C TAAGC T GCC T TGGCAGT TT GCAT T TT T GAGCCAAACAAAAATATAT TAT T T T CCC T TCTAAGTA 

J:? 1 AAAAAAAAAAAAAAAAAAAA 

[II 

CI 

Hi 



ill 
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FIGURE 166 

MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMS QRGS GQCVI H YS KGFRWHQNL S L FYKDC F 

Important features of the protein: 
Signal peptide: 



n amino acids 1-22 



N-glycosylation site. 

amino acids 294-298 



B cAMP- and cGMP-dependent protein kinase phosphorylation site. 

D amino acids 30-34 



Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites ♦ 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
s . GCC TTGAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGT TGCAAAACCTGACAAAACCCGGG 

^ ACAGGGAT T T CT GATGGT GAT T TC TGGATAGGGC T T TGGAGGAATGGAGATGGGCAAACAT CT GG 

CI T GCCT GC CCAGAT C TC TACCAGTGG TC TGATGGAAGCAAT TC CCAGTACC GAAACT GGTACACAG 

O ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
p| C T T GGGGGTCCCTACC T T TACCAGT GGAATGATGACAGGT G T AACATGAAGCACAAT TAT AT T TG 

Hi CAAGTATGAACCAGAGAT TAATCCAACAGCCCCT GTAGAAAAGCC TTATCTTACAAATCAACCAG 

GAGAC AC C CAT C AG AAT GTGGTTGT TAC T GAAG C AG G T AT AAT T C C C AAT C T AAT T TAT G T T G T T 
W ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
Ul GCATAAAAGTAAAGGAAGAACAAAAACTAG TC CAAACCAGTCTACAC TGT GGAT T T CAAAGAGTA 

j; CCAGAAAAGAAAGT GGCAT GGAAGT ATAA TAAC T CATT GACT T GGTTCCAGAAT TT T GTAAT TC T 

GGATCTGTATAAGGAAT GGCATCAGAACAATAGC T T GGAAT GGCT TGAAATCAC AAAGGAT CTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
w TT TAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCT TATT TTGCTAAAGGATGCACCCAA 

Ul ACT TCAAAC T TCAAGCAAAT GAAAT GGACAAT GCAGATAAAGT TGT TAT CAACACGT CGGGAGTA 

Q TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
pi TGTATAT TG TATT GAAATT TACAGT GTGCAAAAGTAT T T TACC TT TGCATAAG TGT TTGATAAAA 

J.^ ATGAACTGT TCTAATAT TTATTTTTATGGCATCTCATTT TTCAATACATGCTC T TT TGAT TAAAG 

W AAAC T TAT TACTG T TGT CAACTGAAT T CACACACACACAAATATAGTACCAT AGAAAAAGT T TGT 

f|| TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTAT T TCAT T AAGT GTGATATAAACC T CCT CAAACAT T TTAC T TAGAGGCAAGGAT 
TGTCTAATT TCAATTGT GCAAGACATGTGCCT TATAATTATTT TTAGCTTAAAATTAAACAGAT T 
T TG TAATAAT G TAACT T TGT TAATAGGT GCAT AAACACTAATGCAG T CAAT T T GAACAAAAGAAG 
TGACATACAC7UVTATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TATAT TAGTATACAAAGAGGTCATGTGGT T GAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT TT TAGGAAGGAAAG GAAC TAC GAAAT CGTGT GAAAAT GGGTT GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 



pi 



3 Transmembrane domain: 

amino acids 214-235 

n 

I N-glycosylation sites. 

amino acids 86-89, 255-258 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 26 6-269 

3::: ;t 

PJ N-myristoylation sites. 

FU amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 

145, 212-217 
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